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As part of the evaluation of the effectiveness of full-year Head 
Start programs, children from a nationwide sample of centers were 
tested or rated with five different instruments. The centers were se- 
lected to be representative of programs of different lengths. Testing 
was performed at the end of the programs. Analyses were made to 
test the hypothesis that the length of the program affects the perform- 
ance of the children. There was no reliable evidence of a systematic 
relationship between length of participation in a program and level of 
performance or development as measured by the Peabody Picture Vo- 
cabulary Test (PPVT), the Pre -School Inventory (FSI), or the Vineland 
Social Maturity Scale (VSMS). There was a significant decrease in 
scores for intermediate -length program children on the Behavior Inven- 
tory (BI); however, that result did not appear to be directly related to the 
effects of the program on children. Factors affecting the interpreta- 
tion of results are discussed in this report. 

Descriptive statistics on the test scores, characteristics of 
parents and families, and characteristics of Child Development Center 
(CDC) staff members are presented. Intertest correlational data are 
also reported. Data and analysis of Draw-A-Person (DAP) tests are 
also presented. 
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I. INTRODUCTION 



A. Background 

The study reported here was undertaken as part of a national pro- 
gram of evaluation of the 635 1966 full-year Head Start programs. When 
the study was initiated, a major data collection effort had already been 
designed and implemented by the U. S. Bureau of the Census. It was the 
task of the Census Bureau to obtain from a national sample of Child De- 
velopment Centers (CDC's), children, and parents, standardized infor- 
mation about: 

• The age, sex, and race characteristics of the children. 

• The medical and dental history and status of the children. 

® The characteristics of staff members and workers in the 

sample CDC's. 

• The evaluation of the individual programs by staff members 
and workers. 

• The participation of parents in the local programs. 

• The structure and other socio-economic characteristics 
of the families of the children. 

The data collected by the Census Bureau have provided, for the 
most part, important descriptive information about the characteristics 
of various elements of the full-year programs. They will not (and were 
not intended to) provide measures of the performance or achievement 
of the children served. It was the purpose of the present study to fill 
this need partially and thus to supply an important dimension in the 
evaluation of program effectiveness at the national level. 

B. Report Organization 

This report is organized primarily in two parts. Sections II 
through VI describe the study and present and discuss results. Appen- 
dixes A through F provide amplifying and supplementary material. In 
Section II, the objectives of the study are stated. Section III provides a 
description of the study design, the tests used and measures obtained, 
the experimental design, and the data collection procedures. Section IV 



PRC R-886 
2 



presents the results, Section V discusses these results, and Section VI 
lists conclusions. The appendixes provide details on the sample centers 
and sample universes (Appendix A), the statistical models (Appendix B), 
the training and supervision of testers (Appendix C), the tests (Appendix 
D), a subsidiary analysis of high- and low-scoring centers (Appendix £), 
and the Draw-A-Person test results (Appendix F). 
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n. STUDY OBJECTIVES 



The primary objective of this study was to evaluate the effective- 
ness of 1965-66 full-year Head Start programs for culturally deprived 
pre-school children. Unlike the summer Head Start programs, which 
ran for approximately 8 weeks, the full-year programs during the first 
full year varied enormously in starting times, ending times, and 
length. Some programs commenced in the fall of 1965. Others were 
still coming into being in May 1966, when this study was initiated. 

Some programs ended in March or April of 1966; others did not end 
until January or February of 1967. Some were conducted for over 40 
weeks, and others for less than 10. This very diversity offered an 
opportunity for an evaluation of the effectiveness of the programs that 
was quite impossible with the short, fixed -length summer Head Start 
programs. From the point of view of program planning and resource 
allocation, it would be extremely valuable to know something about the 
relationship between length (as well as kind) of program and output, per 
formance, or effectiveness. 

This study undertook to accomplish the objective stated above by 
testing the hypothesis that the length of a Head Start program affects 
the level of performance, achievement, or behavior of Head Start 
children. 

A secondary objective of this study was to provide further descrip 
tive data on the performance and behavioral characteristics of Head 
Start children in cognitive, social, and emotional areas of development. 
As part of the accomplishment of both objectives. Planning Research 
Corporation (PRC) also wished to examine some possible relationships 
between workers and programs on the one hand, and the performance 
of the children on the other. 
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III. APPROACH 



A. General Approach 

The general approach of the study was to administer tests to 
samples of children in programs of different durations, and to obtain 
ratings of the behavior and abilities of the children from their teachers 
in the selected Child Development Centers. The principal independent 
variable was the length of the individual program; the dependent var- 
iables were the performances or ratings of the children as measured 
by scores on the tests or scales. While there was generally a strong 
correlation between the length of the program and the length of time 
that a given child had been in the program, the two variables are not 
synonymous. Consequently, two main types of analyses had to be 
made, based on these two separate duration or exposure variables 
(see Section IV). 

The major methodological constraint on the general study de- 
sign resulted from the timing of the study. The study was initiated in 
May 1966 , by which time pre-testing of children was impossible. Con- 
sequently, the overall approach was based on end-testing of samples 
of children in sets of programs of different durations. Thus, the de- 
sign was essentially that of Campbell and Stanley's Design 6, except 
that of course prior assignment of children to treatment levels was 
not randomized. Three main levels or durations of program were 
considered. These levels are labeled Short-term, Medium-term, 
and Long-term (S, M, and L) and are defined precisely in subsection 
III. C below. The sampling universe of programs consisted of a sub- 
set of CDC's in the Census Bureau's nationwide sample. The rules 
for defining and identifying this subset are also discussed below. Sam- 
ple sizes, both for number of children and number of centers, were 

^Campbell, Donald T. and Julian C. Stanley, "Experimental and Quasi- 
Experimental Designs for Research on Teaching." Chapter 5 in Gage, 
N. L. (ed. ), Handbook of Research on Teaching. Chicago: Rand 
McNally, 19^3^ 
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based on estimate variances for the Peabody Picture Vocabulary Test 
(PPVT) from data obtained in the 1965 summer Head Start research 
and evaluation program.^ 

B. Tests 

The primary instruments used in the testing program were the 
Peabody Picture Vocabulary Test (PPVT) Form B, the revised Pre- 
School Inventory (PSI), the Operation Head Start Behavior Inventory 
(BI), and the Vineland Social Maturity Scale (VSMS). A Draw-A- 
Person (DAP) test was also administered. 

The PPVT is an individual test of verbal ability which does not 
require a verbal response; for example, the tester shows a child a 
page containing four pictures and asks, "Show me 'table'." There are 
150 possible pictures which the child may identify; the "raw score" is 
essentially the number of correctly identified pictures and can be con- 
verted into mental age or intelligence quotient equivalents if desired. 

2 

Form B of the test was used throughout this study. 

The revised PSI is an individually administered 85 -item meas- 
ure of school readiness originally developed for the 1965 summer Head 
Start program by Dr. Bettye Caldwell and Mr. Donald Soule of the Nev/ 
York State University at Syracuse. The test provides a total score, as 
well as four subtest scores of separate factors related to school readi- 
ness: personal-social responsiveness, associative vocabulary, numer- 
ical concept activation, and sensory concept activation. The tester 
adm "isters all items to the child; the highest possible score is 90, as 
five items have a score value of 2 points, while all other items have a 
score value of 1 point. The total raw score can be broken down into 
scores for each of the four factors or subtests. 



Planning Research Corporation,. PRC R-795, Results of the Summer 

1965 Head Start Program , Volumes I and II, H. R. Cort, Jr. et al, 

1966 (Unclassified). ^ 

2 

Dunn, Lloyd M. , Expanded Manual, Peabody Picture Vocabulary Test . 
American Guidance Service, Inc., 720 Washington Ave. S.E., 
Minneapolis, Minnesota, 1965. 
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The BI is an instrument originally developed by Dr. Edward 
Zigler for the 1965 summer Head Start program. It is a set of 50 rating 
scales which are intended to obtain information on a number of behav- 
ioral aspects of children. Twenty-five of the scales are intended to tap 
positive behavioral characteristics (such as ”Is usually carefree; ra *ely 
becomes apprehensive or frightened") and 25 are intended to tap negative 
characteristics (such as "Has little respect for the rights of other chil- 
dren; refuses to wait his turn, usurps toys other children are playing 
with," etc.). The teacher, or someone else who knows the child well, 
rates each child in her class on each of the 50 items or scales. The 
BI can provide an overall behavior-adjustment score and/or a separate 
adjustment score for each of nine behavioral categories (sociability- 
cooperation-politeness; independence -dependence; curiosity-enthusiasm- 
exploration- creativity; persistence; emotionality; self-confidence; 
jealousy-attention- seeking; achievement; and leadership). 

The VSMS is an interview schedule which is given to someone 
who knows the child well; it provides an indication of the child's social 
development, maturity, and independence. First developed by Dr. 

Edgar A. Doll in 1935, the VSMS has a 1965 edition which was used 
throughout this testing program. The scale attempts to evaluate the 
child in eight different areas: self-help, general; self-help, eating; 

self-help, dressing; locomotion; occupation; communication; self- 
direction; and socialization.^ The VSMS provides a total score and 
conversions to Social Age (SA) and Social Quotient (SQ) can be made. 

The DAP (or Draw-A-Man, as originally conceived by Dr. Florence 
Goodenough in 1926) is a simple test of intelligence in which the tester 
asks the child to draw a person. The drawing is later scored against a 
detailed list of criteria and total points for each drawing are converted 
into standard scores (IQ's) or mental age equivalents. In this study the 
children's drawings were scored according to the criteria delineated 



jL 

^Doll, Edgar A. , Vineland Social Maturity Scale, Condensed Manual 
for Directions, 1965 Edition" American Guidance Service, Inc., 

720 Washington Ave. S.E., Minneapolis, Minnesota, 1965. 
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by Harris for male and female drawings. See Appendix F for further 
details concerning administration of the DAP. 

The results of the first four tests are included in the body of the 
report and in the main statistical analyses described below; DAP re- 
sults are reported in Appendix F. 

C. Experimental Design 

As stated earlier, the overall design of the study was an end- 
testing of children in full-year Head Start programs, with the length 
of the program used as the main experimental variable and test scores 
of children used as the main dependent variables. The study was de- 
signed essentially for a straightforward one-way analysis of variance. 
Details of the sampling designs and procedures are described below. 

1 . The Universe 

Because the design variable was the length of the program, 
it is necessary to define starting dates, ending dates, and weeks. To 
this end, Friday is the last day of the week and the week of 14 to 20 
Is/Iay 1966 is week zero. (There are also some negative weeks.) 

The Census Bureau supplied PRC with early estimates of the 
starting dates and the closing dates for (potentially) every full-year 
Head Start center. These early estimates are the dates used in de- 
fining the universe. The universe comprises all of the children in 
certain kinds of centers. The rules for including or excluding centers 
are discussed below. 

If a center's starting date was in week j and its ending date in 
week i , then the center is said to have an i - j week program. Cen- 
ters ending before week 1 and after week 12 are not in the universe. 

Of the remaining centers, only those are in the universe whose pro- 
grams are weeks: 



^Harris, Dale B. , Children's Drawings as Measures of Intellectual 
Maturity. New York^ Harcourt, Brace, and World, 1963. 
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11. 


12, 


13, 


14, 


15 


(S) 


19. 


20, 


21, 


22, 


23 


(M) 


27, 


28, 


29, 


30, 


31, 32, ... 


(L) 



The three groupings define a partition of the universe into three sub- 
universes called S, M, and L. Knowledge of the universe was derived 
primarily from the Census Bureau sample; therefore, further discus- 
sion must await the introduction of this sample under subsection III. C. 3. 

2. Sample Size 

2 2 

If or A is taken to be the variance between centers and O’ 

A e 

the variance within centers, then the variance of the estimate of the 
mean of a subuniverse is (approximately) 




m mn 



(1) 



where n is the number of children sampled from each of m centers. 
Expression (1) emphasized the importance of including as many centers 
as possible in the sample; in fact, the inadequacy of early suggestions 
for the number of centers was argued with a series of sums, the ter- 
minal one of which is reproduced below. 

For the two variances, variance component estimates are taken 
as values from the summer 1965 analysis of PPVT scores--i.e. , set 

= 60.1 (2) 

and = 130.9. (3) 

e 

Then, the variance of the estimate of one of the treatment (S, M, or 
Li) means for 12 children sampled from each of 24 centers is 2.958. 
Thus, the noncentrality parameter for the chi- squared test of equality 
of treatment means if, for example, the actual means are 36, 39» and 
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43, is 8.26, which yields a power of 0.73. Thus, mean differences on 
this order should show up quite well. 

The power of the design against monotonic mean increases in 
program length could have been increased by concentrating centers 
more on the S and L ends, but the risk of observing an inexplicable 
M average would have been too great. 

Some gain in the power to distinguish among S, M, and L will 
result from using four scores, even though they are correlated, rather 
than just the PPVT. 

3. Mechanics of Drawing Centers 

The Census Bureau drew a sample of centers from a uni- 
verse which is larger than but includes PRC's. When deletions were 
made from the Census Bureau sample of those centers which were in 
the Census Bureau's universe but not in PRC's, a second Census 
Bureau sample--one from PRC's universe--was constructed, as is 
shown in the first three exhibits of Appendix A. The PRC sample is 
a subsample of the second Census Bureau sample. It should be noted 
that PRC had to work with an early list that listed 428 centers in the 
Census Bureau sample; the Census Bureau sample contains somewhat 
more than 428 centers. 

In drawing its sample, the Census Bureau used an early estimate 
of the number c of classes in the center. The Census Bureau pro- 
vided PRC with these c's and chose centers with probability 

1/4 for c i 5 

c/20 for 6 i c ^ 19 

1 for c> 20. 
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PRC subsampled centers with probability 

c/T . for c s 5 
J 

5/T. for 6 5; c ^ 19 
c/4Tj for c > 20. 

As a result, centers entered the PRC sample with probability 

c/4T. 

where Tg = 155/24 

Tm = 99/24 
T, = 134/24 

These selection probabilities were used in order to give each 
child in one of the S, M, or L subuniverses an equal probability of 
selection, since approximately one class of children per center was 
tested by PRC. 

Sampling of centers was systematic with a random start (see 
Appendix A). The lists were ordered by region as well as program 
length in the hope that variance could be reduced. This was suggested 
by the Census Bureau's Mr. Robert Hanson, who was most generous 
with suggestions for the drawing of the PRC sample as well as details 
of the Census Bureau draw. 

Those interested in the geographic distribution of full-year Head 
Start centers would be well advised to look at the quite complete de- 
scription of the PRC universe given by the second Census Bureau 
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sample listed in Exhibits A-1, A-2, and A-3 of Appendix A. The geo- 
graphic distribution of centers in this universe is different from the 
textbook kind. 

4. Replacement of Centers 

A number of centers in the original draw had to be re- 
placed, principally because they ended their program so much earlier 
than the early estimated date that they could not be included even with 
the information obtained by phone. (A great deal of rescheduling was 
successful because of the phoning of every center.) Some centers were 
also replaced because the children had been PPVT -tested two or more 
times already. 

The first two centers were replaced ad hoc by centers under the 
same grant; however, since it soon became clear that a more easily 
explainable replacement scheme was needed, one was worked out. 

Each center in the second Census Bureau sample was given a replace- 
ment week number, which i:an differ by one from the ending week num- 
ber since replacement weeks end on Thursday. Centers were then re- 
placed, if necessary, by a second Census Bureau sample center in the 
same OEO region and with a replacement week number as large or 
larger than the replacement week number of the center to be replaced. 
Within these possibilities, probability of choice was as described in 
the previous section with a slight adjustment to overcome the bias be- 
cause of the further conditioning. 

5. Mechanics of Drawing Children 

As explained in subsection III. C. 3, the selection of about 
12 children per center resulted in each child in an S, M, or L sub- 
universe entry in the sample with the same probability. 

In addition to avoiding bias, another PRC aim in the drawing of 
children was to include as many children as possible from the Census 
Bureau sample of children. Since the Census Bureau chose every 
fifth child in every sample center, PRC usually chose more, but some- 
times fewer, children from a center. A detailed set of instructions 




PRC R-886 
13 



was prepared and issued to the testers for the drawinjf of a systematic 
sample augmenting, diminishing, or, if necessary, ignoring the Census 
Bureau sample with the random start appropriate to a number of dif- 
ferent possible lists or orderings of children. It was usually unneces- 
sary for the testers to use these instructions. However, even when 
they were not used, they did serve the purpose of impressing on the 
testers that the choice was not theirs and that the testers should phone, 
if necessary, for instructions specific to the list at hand. For most 
centers it was possible (since the Census Bureau random start had 
been obtained by phone along with class number and si 2 .e information) 
to provide the testers in advance with a list of 15 child identification 
numbers which were Census children, in addition to random choices 
from within classes. An additional five child ID*s were provided as 
spares; in some cases, the testers had to phone for more. Testers 
had to explain all failures to test before being allowed to use spares. 

D. Primary Data Collection 

1 . Selection and Training of Testers 

Candidates for Head Start testers were drawn from several 
sources. They included Washington area universities, Boston area 
universities, the Personnel Placement Office of the Peace Corps, Bryn 
Mawr College, the Professional Placement Office of the U.S. Employ- 
ment Service, and various Planning Research Corporation contacts. 
Approximately 200 candidates were interviewed. Emphasis was placed 
on selecting those individuals who had the following qualifications: 

• A college degree in a field related to education, sociology, 
psychology, or guidance. 

• Experience with pre-schoolers or disadvantaged children. 

• Language fluency in Spanish as well as English. 

• A flexible summer schedule. 

Each individual hired was required to provide a recommendation from 
one of his college instructors. 

Twenty-five testers were trained in two training sessions (12-15 
May and 8-11 June 1966) held at the Center of Adult Education at the 
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University of Maryland. Both training programs lasted for 3-1/2 days 
and covered the following: 

• Orientation to the project and to test administration. 

• Tester-child relationships with emphasis on the culturally 
deprived child. 

• Adult -adult relationships. 

• Role-playing in possible test situations. 

• Procedures in data collection and scoring. 

• Practice in testing young children. 

• Practice in interviewing. 

Testers worked from 1 to 7 weeks in the field but not continuously. 
Some testers were able to devote a major portion of their summer to 
the testing program and thus provided a corps of trained testers with 
considerable experience in testing Head Start children. Quality and 
uniformity in the testing program were controlled by frequent field 
supervision, as explained below. (See Appendix C for a more detailed 
description of the selection, training, and supervision of testers.) 

2. Individual Tests 

The three individual tests were always given in the follow- 
ing order: PSI, PPVT, and DAP. The most quiet and private places 
available were used for the individual tests. The tester began the test- 
ing by taking time to put the child at ease and establish rapport. The 
child received all three tests in one sitting, and the tester allowed time 
for "breaks" and relaxation during testing and between tests. 

Of course some children were unable to be tested. If a child 
would not respond to the tester, despite efforts to establish rapport 
and gain the child's cooperation, a notation was made on his record 
that he was "untestable" and another child was selected for testing. 
Testers generally devoted from 20 to 60 minutes to a child before de- 
ciding that he was untestable. (See page 23 and Exhibits 4 and 5 for a 
description of untested and untestable children.) 
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3. Rating Scales 

After the tester had tested a child, he arranged for an 
interview with the teacher (or teacher-aide) about the child. During 
the interview, which was carefully scheduled to elicit information le- 
quired by the scale, the tester completed the Vineland Social Maturity 
Scale. The tester also gave the teacher a blank BI for each child tested 
and encouraged the teacher to make the ratings that afternoon or evening. 

The tester was responsible for the collection of the 5-piece set 
of data on each child (PSI, PPVT, DAP, BI, and VSMS), as well as for 
the identification of the children to be tested. The latter was based on 
a sampling procedure provided the tester. In no case known did testers 
select or test children on any basis other than the sampling procedures 
given them, as described above. 

4. Testing Supervision 

In order to ensure a high quality of testing, each tester 
was observed at least once and in many cases several times during 
his field work. Two PRC staff members (Naomi H. Henderson and 
Ann O'Keefe) with extensive experience in testing and field testing served 
as field supervisors. They observed the testers during actual test sit- 
uations, and they sat in on Vineland interviews with teachers. No com- 
ments or interruptions were made during the actual testing or interviewing 
however, upon conclusion of a session, the supervisor conferred with the 
tester privately and emphasized those facets of test administration which 
would increase the uniformity and quality of the tester's testing. Ob- 
servations of tester-child situations were arranged so that the super- 
visor was out of the line of vision of the child, and thus detracted as 
little as possible from a normal test situation. 

5. Spanish-Speaking Children 

There were enough Spanish- speaking children in the sam- 
ple to warrant special testing procedures. The presence of Spanish- 
speaking children in a CDC was ascertained, and testers fluent in 
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Spanish were sent to those locations. When a tester encountered a 
Spanish- speaking child, he administered the tests in Spanish^ and an 
appropriate notation was made on the child's test data. (See Exhibit 
31 for a description of the sample's composition in terms of chil- 
dren who were tested in Spanish because their primary language was 
Spanish.) 

E. Supplementary Data Collection 

Since the sample of CDC's in which testing of children was con- 
ducted fell entirely within the national sample of CDC's defined and 
identified by the Bureau of the Census for its data collection, it was 
feasible to use information about staff members, programs, and fam- 
ilies obtained in the Census Bureau's study. In particular, PRC de- 
pended upon the Census Bureau to provide data from the following: 

• Staff Member Information Form (CAP-HS Form 43) 

• Paid and Voluntary Workers' Evaluation Form (CAP-HS 
Form 38) 

• Family Information Form (CAP-HS Form 46) 

The Family Information Form was mailed by the Census Bureau 
to the parents of children in its sample. Additional mailings were 
made by the Census Bureau to the parents of children in PRC's sam- 
ple who were not included in the Census Bureau sample. 



1 



See Appendix D for the Spanish translations used. 
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IV. RESULTS 



This section presents the results of the study with little or no 
discussion or interpretation. The data and analyses to be presented 
are intended to serve two purposes: (1) to support the fulfillment of 

the objectives of the study, and (2) to contribute to the pool of descrip- 
tive data about children, parents, and workers participating in Project 
Head Start, and about the instruments used to access them. 

A, Composition of Resultant Samples 

1 • Children 

As described in subsection III. C, samples of CDC's were 
drawn to represent programs of three different levels of program 
length. These three levels have been designated Short-term, Medium- 
term, and Long-term, and hereafter shall be referred to for conven- 
ience as S, M, and L. The initial designation of S, M, or L CDC's 
was based on records obtained from the Census Bureau, which listed 
each CDC in its sample as of 1 May 1966. The listing included starting 
and ending dates of each CDC. 

As stated earlier, however, while length of program operation 
is a fairly good measure of the length of time that children have been 
in the program, the two variables are not perfectly correlated. Fur- 
thermore, it was found that individual programs in some cases had to 
modify or shift their operating schedules. These two factors have led 
PRC to (1) make adjustments in its classification of centers by S, M, 
or L, and (2) distinguish between two types of children in resultant 
samples. 

a. Child Development Center Classification 

CDC's were classified S, M, or L on the basis of the 
number of weeks the program had been in operation from opening day 
until the end of the week of testing. Centers classified S were those 
of 15 weeks' length or less. M type centers were those of 17 to 23 
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weeks' length; L type centers were 2 5 weeks or more in operating 
length as defined. Most CDC's were tested within 2 weeks of their 
closing time. In no case was a CDC tested more than 3 weeks prior 
to its closing date. Holidays (except for a 2-week period around 
Christmas) were not taken into account in calculation of the length of 
the program or the time in the program, nor was the daily length of 
the program considered, 

A total of 72 CDC's was tested. The location and classification 
of these by S, M, and L are shown in Exhibit 1, It should be noted 
that one CDC fell between the intervals for S and M centers in program 
duration. The principal adjustments in classification of CDC's came 
with centers originally classified as L centers on the basis of the 
Census Bureau listing cited earlier. Three of the original 24 L cen- 
ters were reclassified as M centers. The M center that fell between 
the S and M intervals was dropped from consideration for analyses 
based on an S, M, or L classification. Two S centers were dropped 
from most statistical analyses, since all children in the center samples 
were tested in Spanish, Thus, for analyses of results by S, M, or L, 
there are a total of 69 CDC's, while for descriptive data on eligible 
children, the number of CDC's used is 71, 

It should be noted that, throughout the report, centers are re- 
ferred to by 2- digit numbers, ranging from 01 to 72, These numbers 
are consistent within the report, in that, for example, center 01 always 
refers to one particular center. However, they correspond in no sys- 
tematic way to the list of centers in Exhibit 1, The new 2-digit center 
identification numbers were assigned to prevent particular centers 
from being singled out, and to help maintain anonymity of centers dis- 
cussed in this report, 

b. Eligible Versus Non-Eligible Children 

In all, 964 children in the 72 CDC's were tested with 
the five tests and rating scales. These were, of course, children who 
(1) were identified by PRC's sampling procedure, (2) were available 
for testing during the week in which their CDC was tested, and (3) 
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were found to be testable. However, there was substantial variation 
in the number of weeks that children in given programs had actually 
been in attendance. Since PRC was interested in the effect of the length 
of the programs, it was found necessary to classify the children into 
two groups; eligible and non-eligible. 

The term "eligible” is used to designate that the child was in a 
CDC of the jth type (S, M, or L) and was in the center's program for 
the number of weeks falling within the jth type duration interval. Oc- 
casional illnesses or absences and holidays were not considered in 
calculations of eligibility. The term "non-eligible" or "dropped" is 
used to designate that the child tested had been in the program a fewer 
number of weeks (or in a few instances, a greater number of weeks) 
than those within the defined interval by which his center was classified. 
In effect, then, a child was given the S, M, or L label of his center 
for the purpose of most of PRC's analyses, but he was included in 
many of these analyses only if he was eligible. That is, in an analysis 
of effects by S, M, and L, a child from an M center who had only been 
in the program for a short time would not be included with S children; 
he would be dropped from the tabulation or analysis. The total sam- 
ples, whether for a center or for S, M, or L type centers combined, 
were the sum of eligible and non-eligible children. For the total com- 
bined S, M, and L sets of centers there were 831 eligible children, 
of whom 67 were tested in Spanish, 

c. Distribution of Children by Sex, Race, and Age 

Exhibit 2 shows the composition by sex, race, and 
age of the S, M, and L samples, based on all children tested (i, e,, 
eligible plus non-eligible). The percentages are percentages within 
each major sample. The number of children in each sex, race, and 
age category is given only for the finest breakdown to avoid cluttering 
the tables. 

Exhibit 3 shows the composition by sex, race, and age of the 
S, M, and L samples, based on all eligible children tested. Generally 
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in this report the classification "non-white" refers to Negro children. 
However, included in this classification are also two Oriental children 
and nine Indian children. 

d. Distribution of Children by Language Used in Testing 

Exhibit 4 shows the composition by sex, age, and 
primary language (English or Spanish) of the S, M, and L samples, 
based on all eligible children tested. 

e. Untestable and Untested Children 

As mentioned earlier (see page 14), some children 
in the original sample were unable to be tested. In some cases, the 
children were simply unavailable due to absence. In other cases, the 
child was untestable because he would not respond to the tester in the 
test situation. Testers generally spent from 20 to 60 minutes attempt- 
ing to gain a child's cooperation before deciding to consider the child 
untestable. 

Exhibit 5 presents a description in terms of sex, race, and 
age (where such information was available) of the number of children 
who were untestable (UT) or absent (A). Absent children included 
children who had never attended the program but had been registered, 
children who were ill, children who had been withdrawn from the pro- 
gram for a variety of reasons, and children whose families had moved. 
Untestable children included children who (1) refused to go with the 
tester to the testing room, (2) were extremely reticent, (3) were tear- 
ful and uncommunicative, (4) spoke unintelligibly, and (5) were un- 
manageable and hyperactive. 

Of the children who were selected for inclusion in the test sample, 
60 were eventually deemed untestable and 257 were found to be absent 
or unavailable during the week of testing at their center. 



^When a child's primary language was Spanish, he was almost always 
tested in Spanish by a tester fluent in the Spanish language. (See Ap- 
pendix D for Spanish version of tests. ) However, if a child's primary 
language was Spanish, but he was tested in English because in the 
tester's judgment he was not unduly restricted by the language, that 
child was not classified as "Spanish- speaking, " "tested in Spanish," or 
"Spanish." 
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EXHIBIT 5 - NUMBER OF UNTESTABLE AND UNTESTED (ABSENT) CHILDREN, 
BY SEX, RACE, AND AGE 
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(2) Children for whom neither race nor sex was indicated 

(3) UT indicates untestable. 

(4) A indicates absent or unavailable. 

(5) Children for whom age was not indicated. 



PRC R-886 

26 



Exhibit 6 shows the percentages of untestable children by sex, 
race, and age. In this exhibit, only untestable children for whom race, 
sex, and age were indicated (N =51) are included. (If all. 60 untestable 
children are included, the total percent of children considered untest- 
able rises from the 5.02 percent indicated in Exhibit 6 to about 5.9 per- 
cent. ^ Exhibits 5 and 6 show that 3- and 4- year-old children were 
more likely to be considered untestable than 5- and 6-year-old children. 

It also shows that boys tended to be more untestable than girls, and 
whites more than non-whites. In effect, the children with the highest 
untestable rates were young white boys. 

2. Staff Members - Distributions by Characteristics 

In this subsection, the total staff member population for 
S, M, and L centers will be described in terms of their position, selec- 
ted characteristics, and relevant experience. All data are from the 
Staff Member Information Forms (CAP-HS Form 43) collected from 
the selected Child Development Centers) by the Bureau of the Census. Head 
Start workers were classified according to the following four positions: 

• Administrative workers 

• Teachers 

• Other professionals (psychologists, physicians, dentists, 

and social workers) 

• Non-professionals (paid teachers' aides and volunteers) 

Exhibit 7 presents percentages of staff members by position for 

S, M, and L centers. In S centers, non-professional workers made 
up about 67 percent of the total staff, with teachers accounting for the 
next 21 percent. This same distribution pattern prevailed in M and L 
centers, with percentages reported for each staff position being pro- 
portionately the same. In all three levels, the administrative workers 
compribcd the smallest percent of the total staff population. 



^In all, 1,024 children were approached for testing, 964 were tested, 
and 60 were considered untestable. 



EXHIBIT 6 - PERCENTAGE OF UNTESTABLE CHILDREN BY SEX, RACE, AND AGE 
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EXHIBIT 7 - PERCENTAGE OF STAFF MEMBERS BY STAFF POSITION, 
FOR SHORT-, MEDIUM-, AND LONG-TERM CENTERS(1 ) 
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Reported information on age of staff members is shown in Ex- 
hibit 8. Percent figures indicate age by staff position for S, M, and L 
centers. The modal age of administrative staff in all center types 
was 45 years or over. In both S and L centers, teachers most fre- 
quently ranged from 22 to 3 0 years in age and accounted for approxi- 
mately 45 percent of all teachers employed. In M centers, however, 
the majority of teachers were between 31 and 45 years in age. With 
respect to other professionals, about 14 percent in M centers were 
under age 22, while S and L centers listed no "other professionals" 
at this age level. The age range most frequently reported for non- 
professionals was 31 to 45 years, but (as would be expected) a signifi- 
cant percentage in all center types was under age 22. 

Cultural and ethnic backgrounds of staff workers were also 
examined. Eight ethnic and cultural backgrounds were considered 
and Exhibit 9 lists resulting percentage distributions by staff position 
and center duration. About 17 percent of the S staff was Mexican/ 
American; this figure was influenced by the fact that a great majority 
of all Spanish- speaking children were enrolled in S centers. In both 
S and M centers, percentage distributions for total staff population 
were highest for whites, and Negroes had the highest percentage in 
I_i centers. However, it is clear that the relative proportion of Negroes 
and whites employed in each position varied considerably among the 
three categories of program duration. Negroes, for example, occupied 
the majority of administrative positions, except in M centers, where 
whites comprised 85 percent of the administrative staff. White teachers, 
on the other hand, were most frequently reported for all three categories 
of center duration, but sample percentages reported ^^ary greatly. The 
majority of non-professional staff members reported in S and L cen- 
ters was Negro. 

Exhibit 10 presents information on paid and volunteer staff mem- 
bers. Paid workers accounted for at least 76 percent of the total staff 
in each program level. Except in L centers, volunteers most frequently 
reported were the "other professionals," Teachers and administrators 
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EXHIBIT 10 - PERCENTAGE OF PAID VERSUS VOLUNTEER STAFF 

MEMBERS BY STAFF POSITION, FOR SHORT-, MEDIUM 
AND LONG-TERM CENTERS 



Program 

Duration 


Type of Position 


Total(l) 


Paid 


Volunteer 




Administrative Workers 


100.0 


87.0 


13.0 




Teachers 


100.0 


88.9 


8.4 


Short 


Other Professionals 


100.0 


40.2 


59.8 




Non-Pr ofes sionals 


100.0 


75.4 


24.6 




Total 


100.0 


76.0 


23.5 




Administrative Workers 


100.0 


66.1 


33.9 




Teachers 


100.0 


91.2 


8.8 


Medium 


Other Professionals 


100.0 


45.9 


54.1 




Non-Professionals 


100.0 


79.1 


20.5 




Total 


100.0 


76.5 


23.3 




Administrative Workers 


100.0 


100.0 


- 




Teachers 


100.0 


98.7 


1.3 


Long 


Other Professionals 


100.0 


100.0 


- 




Non-Pr ofes sionals 


100.0 


79.6 


20.4 




Total 


100.0 


88.4 


11.6 



Note: (1) Percentages may not add to 100 due to rounding 
or not being answered. 
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held the Hghest percentage of paid positions, except in M centers, 
where teachers and non-professionals comprised the greatest percent- 
age of the paid staff. 

Data on staff experience with pre-school children are presented 
in Exhibit 11. Again, figures listed are percentages by staff position 
for S, M, and L centers. The information reported indicates that the 
greatest percentage of the total staff population in both S and M centers 
had no previous experience with pre-school children. For administra- 
tive staff, a minimum of 5 years’ experience was most frequently 
reported in all levels. Teachers most frequently had from 1 to 3 yeavs’ 
experience while the modal percentage of non-professionals in two of 
the three levels had not worked with pre-school children before. The 
lower age level of the non-professionals is probably a prime factor 
contributing to that difference. 

In Exhibit 12 a second analysis is made on the basis of staff 
experience with children from poverty conditions. Again, percents 
represent staff positions for S, M, and L centers. Using these 
criteria, it can be seen that over one-third of the total staff employed 
in each center type had no previous experience with children from 
conditions of poverty. 

Overall percentage distributions by staff position within each 
level are much the same as those presented in Exhibit 11. Again the 
majority of administrators had over 5 years’ experience and teachers 
most frequently reported from 1 to 3 years’ experience related spe- 
cifically to children from poverty backgrounds. The percentage of 
non-professionals reporting no experience with children from conditions 
of poverty included well over one-third of the total non-professionals 
employed. 

3, Fa milies - Distributions by Characteristics 

Descriptive information about Head Start families was 
obtained by the Bureau of the Census from the Family Information 
Form (CAP-HS Form 46). Since many questionnaires were either 
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incomplete or not returned, all resulting data are based on the returns 
of 656 families.^ Exhibit 13 shows the percentage of families respond- 
ing for (1) all sample children in the 71 S, M, and L centers, based on 
the assumption of a family per child in the sample. 

Exhibit 14 summarizes the overall data relevant to both number 
of siblings and structural characteristics for families of children in 
S, M, and L centers. The average number of siblings in the S, M, and 
L center families was 3,76, 3.83, and 3.39, respectively. The informa- 
tion reported also indicates that for families of children in all three 
types of centers over 50 percent of the siblings were in the 6- to 15- 
year age range, while children under 6 years of age accounted for 
approximately 37 percent of the siblings. 

Mean family size varies slightly among different subsamples. Ex- 
hibit 15 (page 39) presents the mean family size for three subgroups of 

children: eligible English-speaking children, eligible English-speaking 

children living with their mother and father, and children tested in 
Spanish. For the first group--English- speaking eligible children 
(regardless of whether or not a father was living at home)--the mean 
family size was 6.69 people. The second group consisted only of 
English-speaking children who lived with both father and mother; as 
might be expected, the mean family size was slightly higher, 7.71 
people. For purposes of comparison, a separate mean family size 
was also calculated for a third small group--the 42 children who were 
tested in Spanish (regardless of whether or not they lived with both 
parents )--and was found to be 7.02 people. 

In regard to family structural characteristics. Exhibit 14 shows 
that over 96 percent of the families in all three program types reported 



The Family Information Form unfortunately has a complicated- 
appearing format and uses somewhat sophisticated language. It is 
quite likely that some information reported on this form may there- 
fore be inaccurate, since responding families may have at times 
misread or misinterpreted items or instructions. Furthermore, no 
reliability check on the reported data was made. 



EXHIBIT 13 - PERCENTAGE OF FAMILY RESPONSE BASED ON SAMPLE CHILDREN 
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that the mother was living at home with the child; the percentage of 
fathers living at home ranged from 69 to 76 percent among the three 
center types. These percentages, of course, take into account only 
families from a portion of the total sample. Exhibit 16 also shows the 
percentage of families reporting fathers living at home, but percentages 
in this exhibit are based on the actual numbers of total and eligible 
sample children in each program type. A comparison of these percent- 
age figures with those given in Exhibit 14 shows that when the total 
sample is considered, the percentage of families with a father living 
at home decreases considerably. 

The data presented in Exhibit 14 also include information on 
families receiving public welfare. Again, figures given are percentages 
and show that from 2 5 to 31 percent of the reporting families in all 
three center types were receiving public welfare aid. 

The level of education completed by parents or guardians (given 
separately for fathers and mothers) of children in S, M, and L centers 
is presented in Exhibit 17. It is interesting that while 165 families 
did not report the father's level of education, only three families did 
not report this information for mothers. For both fathers and mothers, 
an educational level of at least 9 years was most frequently reported. 
Overall percenta^ ^ distributions in both tables show that for the 
majority of all parents reported, the educational level ranged from 7 
to 12 years of school. The highest average incidence of parents re- 
porting no formal education was 3 percent for mothers of S children 
and 2 percent for fathers of M children. On the other hand, the highest 
average incidence of parents who had completed at least some college 
was 12 percent, a figure reported among both mothers and fathers of 
S children. 

Family income data are presented in Exhibit 18, which lists 
percentage figures for families according to program type and level 
of income. The range of possible annual incomes is from less than 
$1,000 to $10,000 or more. As can be seen from the distribution, the 
majority of all families responding reported incomes ranging from 



PRC R-886 
39 



EXHIBIT 15 - MEAN FAMILY SIZE FOR THREE SUBGROUPS OF 

CHILDREN TESTED 



Group Type 


Total 

N 


Mean 

Family Size 


Eligible non- Spanish- speaking children 


517 


6.69 


Eligible non-Spanish- speaking children 






with mother and father 


387 


7.11 


Children tested in Spanish 


42 


7.02 



EXHIBIT 16 - PERCENTAGE OF FAMILIES OF SAMPLE CHILDREN 

WITH FATHER LIVING AT HOME 



Program Durcition 


Total Sample 


Eligible Only 


Short 


49.1 


49.1 


Mcidium 


53.0 


63.4 


Long 


45.3 


61.3 



EXHIBIT 17 - LEVEL OF EDUCATION COMPLETED BY PARENTS OF CHILDREN IN SHORT- 

MEDIUM-, AND LONG-TERM CENTERS 
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0.5 

1.1 

5.3 

8.5 

41.0 

36.7 

6.9 

0.0 

100.0 




1 

2 

10 

16 

77 

69 

13 

0 

188 


o 

<D 

li 

B 

(D 

hi 

U 

0 

0 


Medium 1 




‘ 1.6 
0.0 
7.0 
21.3 
37.7 

27.9 
4.5 
0.0 

99.9 




4 

0 

17 

52 

92 

68 

11 

0 

244 


Short 1 




3.1 

3.6 

8.0 

12.1 

31.3 

28.6 

12.1 

1.3 

100.1 




7 

8 

18 

27 

70 

64 

27 

3 

224 


Father or Male Guardian 


Long 1 




0.0 

2.7 

5.9 

11.2 

24.5 

19.1 
8.5 

28.2 
100.1 


iz; 


0 

5 

11 
21 
46 ; 
36 
16 
53 
188 


1 Medium 1 




2.0 

4.9 

6.1 

14.3 

28.7 
14.3 1 

7.0 

22.5 

99.8 
1 




5 

12 

15 

35 

70 

35 

17 

55 

244 


Short 




1.3 
3.6 

9.4 
12.1 
22.8 

13.4 
12.1 

25.4 
100.1 


2; 


3 

8 

21 
27 
51 
30 
27 
57 ' 
224 


Grade Level of 
Education Completed 


No Formal Education 
Elementary, 1-3 
Elementary, 4-6 
Elementary, 7-8 
High School, 9-11 
High School, 12 
College (some or all) 
Not Reported 
Total^^^ 




Note: (1) Percentages may not equal 100 due to rounding. 



PRC R-886 
41 



EXHIBIT 18 - ANNUAL INCOME OF FAMILIES WITH CHILDREN IN 

SHORT-, MEDIUM-, AND LONG-TERM CENTERS^^ 



Income Level 


Short 


Mec 


ium 


Long ! 


N 


% 


N 


% 


N 


% 


Less than $1,000 


17 


7.6 


21 


8.6 


14 


7.4 


$1,000-$1,999 


32 


14.3 


41 


16.8 


23 


12.2 


$2,000-$2,999 


26 


11.6 


36 


14.8 


30 


16.0 


$3,000-$3,999 


52 


23.2 


51 


20.9 


32 


17.0 


$4,000-$4,999 


51 


22.8 


39 


16.0 


23 


12.2 


$5,000-$5,999 


24 


10.7 


31 


12,7 


37 


19.8 


$6, 000- $7, 999 


14 


6.3 


20 


8.2 


23 


12.2 


$8,000-$9,999 


2 


0.9 


3 


1.2 


4 


2.1 


$10,000 or more 


5 


2.2 


2 


0.8 


2 


1.1 


Not reported 


1 


0.4 


0 


0.0 


0 


0.0 


Total 


224 


100.0 


244 


100.0 


188 


100.0 



Note: (1) Data were obtained from Family Information Forms sent to 
families with children in the 1966 full-year Head Start 
program. 
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$2,000 to $5,000. The most frequently reported income was in the 
$3,000 to $4,000 range. Reported information also shows that, while 
few families listed incomes of $6,000 or more, roughly 8 percent of 
the families in the programs had incomes of less than $1,000, 

B, Tep-t Results 

1, Distributions of Raw Scores ^d Unadjusted Means by 
Short", Medium-, and Long-Term Centers 

Exhibits 19A through 22C present raw score distributions 
on each of the four main tests for all 964 children tested in S, M, and 
L centers. Dropped or non-eligible children have been designated by 
an asterisk. Within a treatment classification, the distributions have 
been classified according to sex, age, and race. 

Exhibits 19A, 19B, and 19C gives the distributions of raw scores 
for the Peabody Picture Vocabulary Test (PPVT), Form B, For this 
test the raw score is the ceiling point minus the number of items wrong. 
Exhibits 20A, 2 OB, and 20C give the distributions of raw scores of 
children for the revised Pre-School Inventory (PSI), Here the raw 
score is the total number of points obtained for the whole test, or the 
sum of the scores of the four subtests. The range of possible scores 
for the PSI total is from 0 to 90. 

Exhibits 21 A, 2 IB, and 21C present the distributions of raw s 
scores for the Behavior Inventory (BI). These are BI total scores, 
obtained by the addition of the total number of points for positive items 
to the total number of points for negative items. The range of possible 
scores is from 50 to 200, 

Exhibits 22A, 22B, and 22C give the distributions of total raw 
scores for the Vineland Social Maturity Scale (VSMS), The total raw 
score on the VSMS is the basal score plus all points for which credit 
was received beyond the basal score. 

The final six rows of Exhibits 19A through 22C indicate the num- 
bers of children (N's), and mean and median raw scores for each sex- 
race-age classification, for both "total” children (N = 964, representing 




EXHIBIT 19A - DISTRIBUTION OF PPVT RAW SCORES FOR CHILDREN IN SHORT-TERM 

CENTERS, BY SEX, RACE, AND AGE 
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EXHIBIT 19A (Continued) 
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EXHIBIT 19B - DISTRIBUTION OF PPVT RAW SCORES FOR CHILDREN IN MEDIUM- 

TERM CENTERS, BY SEX, RACE, AND AGE 
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Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 17 weeks. 



EXHIBIT 19B (Continued) 



PRC R-886 
46 



i 

i) 


Non- White I 


e 

tc 

d 

%) 

o« 

< 


6-6 

6-11 


n- >0 


O o 
in in 
• • 
fM >o >n 
to fO 


o o 
in m 
• • 

r>i >0 >0 


o m 


>o 


1 

61.0C 

61.00 


1 

61.00 

61.00 


5-6 

5-11 


00 >0 0 ^ ^ m 
^ ^ in in 


5 

46.40 

49.00 


o o 

^ o 
• • 


5-0 

5-5 


fOto^ootn^co^fMfo^ 

<M<MtgrsjtOfOfO^^^^ 


o o 
^ o 

• • 

^ to ^ 
e-< to to 


6 

52.33 

30.50 


4-6 

4-11 


<> <> 

inr-ooo^-HrMin^^fom>nio 

iMrMrgr>jtotOtoto^^^^^ 


in o 
Bjq 

»n in in 
^ to to 


pn o 

r- o 
• . 

•** in m 

tn tn 


4-0 

4-5 


<}• 

m >o >o r** 

rM rvj rvj rvj rvj 


5 

25.00 

26.00 


4 

24.75 

25.50 


fO to 


m r- 00 o ^ in 00 

^ ^ (M IM tsi fs4 


7 

20.71 

20.00 


7 

20.71 

20.00 


o m 
to to 




1 

21.00 

21.00 


00*0 

00*0 

0 


White 


Age Range 


O sO 


^ r- o rg IM sO 

IN] in m in >o >o so 


' 7 
55.57 
59.00 


7 

55.57 

59.00 


6-0 

6-5 


(Mtotoinoo— <^h-ooo^o^toto^^r-o*« 
tototototo^^^^^^inminininm 


17 

45.82 

48.00 


17 

45.82 

48.00 


>o ^ 
1 1 
in in 


to(7^^^oininin(^oo^rMrM^intn>oeoto 

^iMtoto^^^^^ininminininininmin^ 


20 

46.50 

50.00 


20 

46.50 

50.00 


o in 
m m 


# ^ ^ # # # # 

^ooooo^to^NONOO^^iM iMtommooooooooooorgrsito^inr- 

OlMlMtOtOtOtOtOtO^^^^^^^^^^^^^lftH^iPIII^II^II^II^II^ 


30 

42.27 

44.00 


23 

40.65 

42.00 


1 1 


^ ^ ^ ^ 
ttooo^oiMvOh* ^m>o^^to^inininvo^^^or*"r-r-ooto^inin>n 


30 

39.73 

45.00 


24 

39.25 

45.00 


4-0 

4-5 


iM m iM o ^ r- to 
rvi rvi to ^ ^ ^ in 


7 

37.14 

40.00 


7 

37.14 

40.00 




>o ^ 
to to 


to 00 

^ ^ to to 


4 

26.75 

28.00 


4 

26.75 

28.00 




o in 

cA cA 


1 


1 1 1 


1 1 1 




fli 

'3 ^ "® 
|zSS 

H 


a 

JS s-S 
5 U 

C.' 



o 




Note; (1) An a«teri«k denote# an ineligible child, who wa« in the program for lea# than 17 week#. 
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EXHIBIT 19C - DISTRIBUTION OF PPVT RAW SCORES FOR CHILDREN 

IN LONG-TERM CENTERS, BY SEX, RACE, AND AGE 
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Age Range 
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2 






Mean 


5.00 


- 
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- 
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46.57 


40.00 
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0,00 


- 


22.00 


41.00 


51.00 


40.00 


- 


- 



Note: (1) An aiterifk denotes an ineligible childi who was in the program for less than 25 weeks. 
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EXHIBIT 19C (Continued) 





Male 




Non- White 




Age Range 




3-0 
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42.00 


38.50 
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12 
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- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for lass than 25 weeks. 
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Female 



White 



Age Range 



3-0 

3-5 



3-6 

3-11 



4-0 

4-5 



4-6 

4-11 



31 

42 



23 

54 



Total 

N 

Mean 

Median 



Eligible 

N 

Mean 

Median 



24 

30 

32 



2 

36.50 

36.50 



2 

38.50 

38.50 
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28.67 

30.00 



2 

36.50 

36.50 
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30.00 



5-0 

5-5 



5-6 

5-11 
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Note: 



(1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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EXHIBIT 19C (Continued) 
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48 
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52 
















43 


52* 
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53* 
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56 
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N 


3 


6 


19 


48 


45 


4 


1 




Mean 


34.00 


26.17 


34.21 


36.90 


38.16 


49.75 


47.00 




Median 


39.00 


28.00 


34.00 


37.50 


37.00 


48.00 


47.00 




Eligible 


















N 


3 


4 


10 


31 


33 


4 


1 




Mean 


34.00 


21.75 


33.90 


37.52 


38.09 


49.75 


47.00 




Median 


39.00 


22.00 


34.00 


37.00 


36.00 


48.00 


47.00 





Note: (1) An aeteriik denotei an ineligible child, who waa in the program for leaa than 25 weeks. 
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20A - DISTRIBUTION OF PSI RAW SCORES FOR CHILDREN IN SHORT-TERM 
CENTERS, BY SEX, RACE, AND AGE 
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Note: (1) An asterisk denotes an ineligible child, who was in the program f' * less than 17 weeks. 
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Note: (i) An asterisk denotes an ineligible child, who was in the program for less than 17 weeks 
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EXHIBIT 20C - DISTRIBUTION OF PSI RAW SCORES FOR CHILDREN 

IN LONG-TERM CENTERS, BY SEX. RACE, AND AGE 





Male 
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Age Range 


3-0 

3-5 


3-6 

3-11 


4.0 

4-5 


4-6 

4-11 


5-0 

5-5 


5-6 

5-11 


6-0 

6-5 


6-6 

6-11 


06*0) 

1 




16 


24* 

21 

30* 

35 

39 

40 
40 

42 

43 
66 
74 


29 

32 

43* 

50 

52 

56* 

67 

69 

69 


47 

72* 


m 




Total 

N 

' Mean 
Median 


1 

6.00 

6.00 


m 


1 

16.00 

16.00 


11 

41.91 

40.00 


9 

51.89 

52.00 


2 

59.50 

59.50 


m 

m 


- 


Eligible 

N 

Mean 

Median 


0 

0.00 

0.00 




1 

16.00 

16.00 


9 

45.22 

40.00 


7 

52.57 

52.00 


1 

47.00 

47.00 


- 


- 



Not*: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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EXHIBIT 20C (Continued) 
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Eligibla 
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1 
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1 
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54.00 



6-0 

6-5 



6*6 

6*11 



Not#: (1) An asterisk denotes an ineligible childg who was in the program for less than 25 weeks 
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EXHIBIT 20C 



(Continued) 
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Not.; (1) An n.Uci.k d.aot.. ,n In.Uglbl. child, ,ho «. la th. pro,ram for 1... than 25 ...k.. 
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Note: ( 1 ) An asterisk denotes an ineligible child, who was in the program for less than 17 weeks. 
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EXHIBIT 21C - DISTRIBUTION OF BI RAW SCORES FOR CHILDREN IN 

LONG-TERM CENTERS, BY SEX, RACE, AND AGE 





Male 




White 






Age Range 




3-0 


3-6 


4-0 
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173 


















176* 










Total 


















N 


1 


- 


1 


11 


9 


2 






Mean 


110.00 


- 


130.00 


147.18 


142.56 


133.50 






Median 


1 10.00 


- 


130.00 


145.00 


147.00 


133.50 


- 


- 


Eligible 


















N 


0 


- 


1 


9 


7 


1 


•M 




Mean 


0.00 


- 


130.00 


147.67 


148.29 


149.00 






Median 


0.00 




130.00 


145.00 


148.00 


149.00 


- 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 





PRC R-886 
64 



(Continued) 



EXHIBIT 2 1C 
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Non- White 






Age Range 


3-0 

3-5 


3-6 

3-11 


4-0 

4-5 


4-6 

4-11 


5-0 

5-5 


5-6 

5-11 


6-0 

6-5 


6-6 

6-11 
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176 


96* 

111 

118* 

124 

12'',* 

127* 

131* 

134 
134* 

135 
144 
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148 
158 
160 
167 


82 

101 

118 

133 

134* 

138 

149 

152* 

153* 

153 

160 

163 

166 

169 

177 


81 

82* 

94* 

94 

96 

100 

110 

112 

115 

115 

116 
116 
119 
122 

123 

124 

125 

126 
128 
130 

132 

133 
135* 

135 
136* 

136 
136 
137* 
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141 
143 
149* 
151* 
151 
153* 
153 
159* 
161 
161* 

163 

164 

164 

165 
168 
168 
170* 
175 
177 
179 
182 
182 
185 
185 


119 
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127* 

128 

128* 

129 

130* 
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132 
135 
138 
140 
142 
144 
1461' 

147 

154 

155 

156 
156 
159* 

163 

163 

163 

167 

169 

172 

176 

179 

182 

185 

* 


86* 

123 

131 

155 


168 




Total 

N 

Mean 

Median 


2 

174.50 

174.50 


16 

135.00 

134.00 


15 

143.20 

152.00 


53 

139.55 

136.00 


31 

149.71 

147.00 


4 

123.75 

127.00 


1 

168.00 

168.00 


- 


Eligible 

N 

Mean 

Median 


1 

176.00 

176.00 


10 

142.80 

145.50 


12 

142.42 

151.00 


42 

139.74 

135.50 


26 

151.96 

154.50 


3 

136.33 

131.00 


1 

168.00 

168.00 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for leas than 25 weeks. 
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EXHIBIT 2 1C (Continued) 





Female 




White 




Age Range 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4- 11 


5-5 


5-11 


6-5 


6-11 
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120 


64 
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122 
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172 
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/ 

i 


















t 
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Total 


















N 


- 


2 


2 


3 


7 


1 


- 


- 


Mean 


- 


127.00 


139.50 


155.00 


135.14 


161.00 




- 


Median 


- 


127.00 


139.50 


162.00 


153.00 


161.00 


- 


- 


Eligible 


















N 


- 


2 


2 


3 


5 


1 


- 


- 


Mean 


- 


127.00 


139.50 


155.00 


139.00 


161.00 


- 


- 


Median 


- 


127.00 


139.50 


162.00 


153.00 


161.00 


- 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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F«m4l« 


Non- White 






Age Range 


3-0 

3-5 


3-6 

3-11 


4-0 

4-5 


4-6 

4-11 


5-0 

5-5 


5-6 

5-11 


6-0 

6-5 


6-6 

6-11 


96 

132 

152 


117 

142*(1) 

144* 

144 

155 

168 


no* 

124 

124 

127 

140* 

141 

142* 

143 

146* 

148 

151 
151* 

152 
154 
157 
158* 
167* 
168* 
173* 


104 

108* 

111 

112 

114 

116 

117 

119 
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121* 

124 

127 

134 

134 

134 

135* 

136* 

136* 

137 

137 

138* 

141 

141* 

143 

144 
148 
150 
153* 
.155* 

155 

156 
159* 
159* 
160 
161 
163* 
164* 
166* 

167 

168 
168 
170 
171* 
177* 
178 
184* 
188 
194 


99 

102 

108 

no 

114 

129 

130 
132* 
134 
137 
140 
140 

140 

141 
147 
147 
150 
151* 
151* 
152 
153* 

155 

156 

156 
157* 

157 
158* 
158* 
158* 
159 
161* 
165 
167 
170 

172 

173 
174* 
174* 
176* 
178 

184 

185 
187 
191 
191 


119 

137 

157 

177 


175 




Total 

N 

M*an 

Median 


3 

126.67 

132.00 


6 

145.00 

144.00 


19 

146.11 

148.00 


48 

145.80 

143.50 


45 

152.64 

156.00 


4 

147.50 

147.00 


1 

175.00 

175.00 


- 


Eligible 

N 

Mean 

Median 


3 

126.67 

132.00 


4 

146 30 
149.50 


10 

142.00 

145.50 


3J 

142.97 

141.00 


33 

149.94 

151.00 


4 

147.50 

147.00 


1 

175.00 

175.00 


- 



Not*: (1) An astariik d«notes an inaligibl* child, who was in th* program for !■■■ than 25 w**ka. 
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EXHIBIT 22A - DISTRIBUTION OF VSMS RAW SCORES FOR CHILDREN IN SHORT-TERM 

CENTERS, BY SEX, RACE, AND AGE 
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EXHIBIT 22A (Continued) 
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EXHIBIT 22B - DISTRIBUTION OF VSMS RAW SCORES FOR CHILDREN IN MEDIUM- 

TERM CENTERS, BY SEX, RACE, AND AGE 
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Note: (1) An asterisk dsnotes an insligible child, who was in the program for less than 17 weeks. 
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Note: (1) An a.teri.k denote, an ineligible child, who was in the program for than 17 weeks 



PRC R-886 
71 



EXHIBIT 22C - DISTRIBUTION OF VSMS RAW SCORES FOR CHILDREN 

IN LONG-TERM CENTERS, BY SEX, RACE, AND AGE 





Male 




White 






Age Range 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 


6-5 


6-11 




40.5*(^> 




54.5 


47.5* 


52.5 


53.0 














52.0 


54.0 


71.0* 














53.5 


57.0 
















54.0 


58.0* 
















55.0 


58.5 
















55.0 


59.0* 
















57.0 


60.0 
















58.5 


62.0 
















60.5 


62.5 
















62.0* 


















63.0 










Total 


















N 


1 


- 


1 


11 


9 


2 






Mean 


40.50 


- 


54.50 


56.18 


58.17 


62.00 






Median 


40.50 


- 


54.50 


55.00 


58.50 


62.00 


- 


- 


Eligible 


















N 


0 


- 


1 


9 


7 


1 






Mean 


0.00 


- 


54.50 


56.50 


58.07 


53.00 






Median 


0.00 




54.50 


55.00 


58.50 


53.00 


- 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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EXHIBIT 22C (Continued) 





Male 




Non- Whit* 








Ago Rang. 




3-0 


3«*6 


4-0 


4-6 


5-0 
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5-11 


6-5 


6-11 




45.5*(^) 


36.5 


45.0 


38.5 


47.0 


55.0* 


63i0 






59.0 


40.0 


49.5 


46.5* 


52.0 


56.5 








45.0 


50.0 


47.0 


52.0 


57.0 










45.0 


50.0* 


48.5 


52.0 


59.0 










45.0* 


50.5 


49.0 


52.5 










46.0* 


52.5* 


49.5 


53.5 












48.5* 


53.5 


49.5 


54.5 












49.0* 


53.5 


50.0 


56.0 












50.5 


55.5 


50.5 


56.0 












51.0 


56.5 


51.0 


56.5 












51.0 


57.5 


51.5* 


56.5 












51.0* 


58.0 


52.0 


56.5* 












52.5 


58.0 


52.5 


57.0 












54.5 


58.0 


52.5 


57.0* 












56.0 


61.0* 


53.0 


58.5 












59.0* 




53.0 


58.5 
















53.0 


58.5 
















53.5 


58.5* 
















54.0 


59.5* 
















55.0 


60.5 
















55.0 


60.5 
















55.5 


61.0 
















56.0 


61.5 
















56.0 


61.5 
















56.0 


62.0* 
















57.0* 


63.0 
















57.5 


63.5 
















57.5 


64.5 
















57.5* 


65.5 
















58.0* 


66.5 
















58.0 

58.0 

58.0 
58.5 

58.5 
59.0* 

59.0 
59.0* 

59.5 

59.5 
59.5* 

60.0 
60.0 
60.0* 
61.0 
61.0 
61.0* 
61.0 
61.0 
62.0 

62.5 
63.0* 
66.0 


70.5 








Total 


















N 

Mean 

Median 


2 

52.25 

52.25 


16 

48.78 

49.75 


15 

53.93 

53.50 


53 

55.87 

57.50 


31 

58.48 

58.50 


4 

56.88 

56.75 


1 

63.00 

63.00 


- 


Eligible 


















N 


1 


10 


12 


42 


26 




1 

63.00 

63.00 




Mean 

Median 

Note* / 1 \ Ar 


59.00 

59.00 

k /4 


48.20 

50.75 


53.79 

54.50 


55.57 

56.00 


58.44 

58.50 


2 

57.50 

57.00 


- 
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EXHIBIT 22C (Continued) 





Female 




White 




Age Range 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 


6-5 


6-11 






48.5 


54.0 


50.5 


52.5*^*^ 


67.5 


•a 


•a 






53.5 


54.0 


55.0 


58.5 
















66.0 


59.5 


















61.0’!' 


















62.0 


















62.0 


















67.5 








Total 


















N 


- 


2 


2 


3 


7 


1 






Mean 


- 


51.00 


54.00 


57.17 


60.43 


67.50 






Median 


- 


51.00 


54.00 


55.00 


61.00 


67.50 


- 


- 


Eligible 


















N 


- 


2 


2 


3 


5 


1 


_ 


,a 


Mean 


- 


51.00 


54.00 


57.17 


61.90 


67.50 




aa 


Median 


- 


51.00 


54.00 


55.00 


62.00 


67.50 


- 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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EXHIBIT 22C 



(Continued) 





Female 




Non- White 








Age Range 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 


6-5 


6-11 




49.0 


42.5 


46.0 


45.5* 


46.5 


58.5 


67.0 






51.5 


46.5 


46.0* 


48.5 


47.5 


59.0 








58.0 


52.5*(^> 


46.5* 


48.5 


50.5* 


60.0 










52.5 


51.0* 


51.5 


51.5 


62.0 










56.0* 


51.5 


52.5 


52.0 












61.5 


51. 5* 


53.5 


54.0* 














52.5 


53.5 


55.0* 














53.0 


54.0 


55.5 














54.0 


54.0 


56.0* 














54.0 


54.5 


56.5* 














54.0* 


55.0 


56.5* 














55.0* 


55.5 


56.5 














55.5 


55.5 


57.0 














56.5* 


55.5 


58.0 














58.0* 


55.5* 


58.0 














58.0 


55.5* 


58.0 














58.0 


55.5 


58.5 














58.0* 


56.5 


58.5 














59.0 


56.5 


58.5 
















56.5 


58.5 
















56.5* 


59.0 
















56.5* 


59.5 
















56.5* 


59.5 
















57.0 


59.5 
















57.0 


59.5 
















57.0* 


59.5 
















57.5* 


60.0 
















57.5* 


60.0 
















57.5 


60.0 
















58.0 


60.0 
















58.0 


60.0 
















58.0 


60.0 
















58.5* 


60.5 
















58.5* 


61.0 
















58.5 


61.5 
















59.0 


61.5 
















59.0 


61.5* 
















60.0 


62.0 
















60.'.* 


62.0 
















60.5* 


62.0* 
















60.5* 


63.0 
















61.0 


65.5* 
















62.0* 


67.0* 
















63.0 


69.0* 
















64.0 


69.5* 
















65.0 


















65.0* 


















68.0 










Total 


















N 


3 


6 


19 


48 


45 


4 


1 


- 


Mean 


52.83 


51.92 


53.58 


57.15 


58.79 


59.88 


67.00 


- 


Median 


51.50 


52.50 


54.00 


57.00 


59.50 


59.50 


67.00 


- 


Eligible 


















N 


3 


4 


10 


31 


33 


4 


1 


- 


Mean 


52.83 


50.75 


54.15 


56.65 


58.47 


59.88 


67.00 


- 


Median 


51.50 


49.50 


54.00 


56.50 


59.50 


59.50 


67.00 


- 



Note: (1) An asterisk denotes an ineligible child, who was in the program for less than 25 weeks. 
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the total number of children tested and included in PRC's data tabula- 
tions and analyses) and ‘'eligible -only" children (N = 831). Scores of 
children tested in Spanish are not separately identified in the arrays. 

Exhibits 23 through 26 present sumrnary tables of unadjusted mean 
raw scores on the PPVT, PSI, BI, and VSMS for the subset of eligible 
children whose tests were administered in Spanish. 

For the PPVT, PSI, BI, and VSMS, Exhibits 27 through 30 pre- 
sent, for S, M, and L eligible children only (N = 831), summaries of 
unadjusted raw score means, standard errors of those means, medians, 
and appropriate N's. These summary tables were prepared from the 
unit raw score distributions in Exhibits 19A through 22C, Medians 
were determined only when the sample size in a subclassification was 
less than 30, For the VSMS and the BI, standard errors of the means 
are presented for S, M, and L total means only. 

Exhibits 31 and 32 present PSI subtest raw score unadjusted 
means for (1) all children, by age and race (Exhibit 31), and (2) eligible 
children only, by age and race (Exhibit 32), Exhibits 33 and 34 show 
the percentages of children answering the item correctly for each sub- 
test for the same groups. These percentages indicate the relative 
difficulty of the different subtests. For example, in Exhibits 33 and 34 
it is clear that subtest 2 was generally more difficult for children at 
each age level, regardless of race. 

Subtest names^ are: 

• Subtest 1: Personal -Social Responsiveness 

• Subtest 2: Associative Vocabulary 

• Subtest 3: Numerical Concept Activation 

• Subtest 4: Sensory Concept Activation 

Finally, an item difficulty breakdown for the PSI subtests is 
given in Exhibit 35, Here, the percentage of eligible children (for 




Test names are those provided by the authors of the tests. See 
Caldwell, Bettye and Soule, Donald, The Pre-School Inventory 
(Unpublished Head Start Study, OEO Contract No, 514). 



EXHIBIT 23 - PPVT SUMMARY OF RAW SCORE MEAlSiS' ' AND SAMPLE SIZES FOR SHORT - 

MEDIUM-, AND LONG-TERM ELIGIBLE CHILDREN, BY LANGUAGE USED IN 
TESTING, RACE, AND AGE 
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f\j o 

fvj p-H f\J 


0 ^ m o 

fvj O 

p™< p— < p^ 


in rsj O 
O 00 

p— < p— < 


00 >-< fM 
nO O 


in f\j 

f\j p-< 

m f\j f\j 


Total, 

Spanish 

and 

English 


28.82 

20.94 

28.25 


36.19 

37.82 

36.21 


42.65 

43.72 

41.25 


43.76 

50.44 

49.50 


39.21 

41.53 

37.49 


Z 


f\J 0 ^ o 

f\J p-H fsj 


vO O vO 
rj O O 

p™< p— < p— < 


.-H 00 


in 00 rsj 
fvj in 


O 00 nO 
00 o 

fVJ fVJ fVJ 


Total, 
Tested in 
English 


28.82 

20.94 

28.25 


36.48 
38.25 
36.42 j 


42.72 

43.78 

41.81 

- 


44.40 ' 

50.53 

49.50 


38.75 

41.65 

37.95 


Z 


in T}< 00 

i-H i-H 


fvj in LO 
o m 0 ^ 


O O vO 
nO fVj nO 


00 fVJ 


ID m ^ 
0^ 00 
p™< p™< 


Non- White, 
Tested in 
English 


28.67 

19.50 

27.28 


36.31 

32.85 

36.58 
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in 


vO 
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Note; (1) Unadjusted means. 
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Note; (1) Unadjusted means. 



EXHIBIT 25 - BI SUMMARY OF RAW SCORE MEANS' * AND SAMPLE SIZES FOR SHORT- 
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ERIC 



Note: (1) Unadjusted means. 



EXHIBIT 26 - VSMS SUMMARY OF RAW SCORE MEANS' ' AND SAMPLE SIZES FOR SHORT- 

MEDIUM-, AND LONG-TERM ELIGIBLE CHILDREN. BY LANGUAGE USED IN 
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Note: (1) Unadjusted means. 



EXHIBIT 28 - PSI SUMMARY OF UNADJUSTED RAW SCORE MEANS, STANDARD ERRORS OF 

MEANS, MEDIANS, AND SAMPLE SIZES FOR ELIGIBLE CHILDREN IN SHORT- 
MEDIUM-, AND LONG-TERM CENTERS 
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Notes: (1) Standard error of the mean* computed only for total, all ages. 

(2) The median was not obtained when N was more than 30. 

(3) Not computed. 



EXHIBIT 31 - PSI SUBTEST MEAN' ' RAW SCORES FOR ALL CHILDREN, BY AGE AND RACE 
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EXHIBIT 35 - PERCENTAGE OF ELIGIBLE CHILDREN PASSING PSI 

ITEMS, BY AGE AND PROGRAM TYPE 
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69.9 


70.6 


61.0 


72.1 


73.7 


66 


.7 


9. 


96.0 


95.7 


66.7 


92.2 


94.3 


96.3 


98.1 


96.3 


97.4 


97.1 


100.0 


100 


.0 


10. 


72.0 


73.9 


70.8 


78.9 


86.8 


81.3 


84.5 


85.3 


89.6 


92.7 


89.5 


66 


.7 


11. 


36.0 


47.8 


54.2 


64.8 


63.2 


69.2 


67.0 


78.0 


88.3 


75.0 


86.0 


100 


.0 


12. 


28.0 


17.4 


20.8 


39.1 


49.1 


42.1 


59.2 


64.2 


54.6 


50.0 


71.9 


66 


.7 


13. 


96.0 


95.7 


95.8 


96.9 


94.3 


97.2 


98.1 


97.3 


94.8 


98.5 


100.0 


100 


.0 


14. 


48.0 


26.1 


37.5 


63.3 


57.6 


42.1 


68.0 


72.5 


58.4 


86.8 


87.7 


66 


.7 


15. 


44.0 


47.8 


50.0 


61.7 


56.6 


68.2 


72.8 


69.7 


74.0 


61.8 


87.7 


100 


.0 


16 . 


40.0 


34.8 


50.0 


55.5 


54.7 


54.2 


68.9 


60.6 


74.0 


52.9 


84.2 


66, 


.7 


17. 


96.0 


65.2 


75.0 


85.9 


79.3 


85.1 


91.3 


91.7 


92.2 


91.2 


96.5 


100, 


.0 


18* 


92.0 


60.9 


91.7 


87.5 


89.6 


96.3 


93.2 


93.6 


94.8 


95.6 


96.5 


100, 


.0 


19. 


20.0 


30.4 


16.7 


47.7 


50.0 


45.8 


63.1 


68.8 


53.3 


73.5 


82.5 


66. 


.7 


20. 


00.0 


4.4 


12.5 


21.1 


21.7 


23.4 


24.3 


38.5 


29.9 


50.0 


61.4 


33. 


.3 


21. 


32.0 


30.4 


20.8 


35.9 


40.6 


27.1 


45.6 


59.6 


35.1 


60.3 


59.6 


33. 


.3 


22. 


8.0 


34.8 


12.5 


18.0 


16.0 


15.0 


20.4 


21.1 


22.1 


25.0 


35.1 


0. 


,0 


23. 


4.0 


17.4 


8.3 


18.0 


28.3 


24.3 


47.6 


44.0 


31.2 


58.8 


70.2 


66. 


.7 


24. 


52.0 


26.1 


20.8 


54.7 


66.0 


54.2 


75.7 


83.5 


79.2 


79.4 


89.5 


100. 


,0 


25. 


4.0 


8.7 


4.2 


14.8 


23.6 


19.6 


30.1 


43.1 


27.3 


54.4 


61.4 


33. 


,3 


26. 


36.0 


47.8 


25.0 


43.8 


72.6 


49.5 


65.1 


78.0 


59.7 


83.8 


84.2 


66. 


,7 


Num- 
ber of 
Chil- 
dren 


25 


23 


24 


128 


106 


107 


103 


109 


77 


68 


57 


3 
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EXHIBIT 35 (Continued) 



PSI SUBTEST 2 



Item 

No. 




3 






4 


5 


6 


S 


M 


L 


S 


M 


L 


S 


M 


L 


S 


M 


L 


27. 




4.0 


4.4 


0.0 


5.5 


15.1 


6.5 


19.4 


21.1 


14.3 


25.0 


35.1 


33 


.3 


2 8 • 




0.0 


4.4 


0.0 


5.5 


12.3 


5.6 


13.6 


22.0 


10.4 


17.7 


45.6 


0 


.0 


2 9 • 




4.0 


4.4 


8.3 


11.7 


14.2 


12.2 


28.2 


31.2 


9.1 


38.2 


57.9 


33 


.3 


30. 




0.0 


8.7 


16.7 


18.0 


17.0 


30.8 


22.3 


22.0 


37.7 


14.7 


29.8 


66 


.7 


31 . 




4.0 


4.4 


8.3 


12.5 


15.1 


15.9 


35.0 


25.7 


27.3 


25.0 


49.1 


0 


.0 


32. 




28.0 


13.0 


20.8 


26.6 


29.3 


29.0 


61.2 


40.4 


50.7 


50.0 


79.0 


66 


.7 


33. 




12.0 


4.4 


8.3 


15.6 


22.6 


17.8 


31.1 


35.8 


28.6 


27.9 


45.6 


0 


.0 


34. 




44.0 


30.4 


25.0 


49.2 


40.6 


48.6 


53.4 


63.3 


62.3 


51.5 


70.2 


33 


.3 


35. 




0.0 


0.0 


4.2 


4.7 


13.2 


7.5 


11.7 


17.4 


23.4 


17.7 


24.6 


0 


.0 


36. 




4.0 


0.0 


0.0 


3.9 


10.4 


5.6 


8.7 


13.8 


9.1 


14.7 


22.8 


33 


.3 


37. 




0.0 


0.0 


4.2 


3.1 


4.7 


6.5 


5.8 


10.1 


10.4 


2.9 


33.3 


33, 


.3 


38 . 




16.0 


21.7 


3.3 


36.7 


37.7 


50.5 


58.3 


53.2 


64.9 


61.8 


77.2 


66, 


.7 


39. 




32.0 


30.4 


25.0 


64.8 


63.2 


52.3 


70.9 


69.7 


72.7 


69.1 


87.7 


66, 


.7 


40. 




56.0 


43.5 


54.2 


74.2 


62.3 


56. 1 


71.8 


70.6 


62.3 


76.5 


87.7 


100. 


.0 


41. 




56.0 


52.2 


50.0 


59.4 


55.7 


69.2 


72.8 


58.7 


75.3 


57.4 


79.0 


66. 


.7 


42. 


-,(1) 


32.0 


52.2 


41.7 


57.0 


62.3 


55.1 


77.7 


64.2 


70.1 


60.3 


82.5 


100. 


.0 


43. 


2' ' 


0.0 


8.7 


4.2 


8.6 


22.6 


14.0 


20.4 


21.1 


19.1 


13.2 


21.1 


0. 


.0 


1 


16.0 


17.4 


16.7 


35.9 


35.8 


41.1 


38.8 


48.6 


58.8 


45.6 


54.4 


66. 


,7 




T 


16.0 


26.1 


20.9 


44.5 


58.4 


55.1 


59.2 


69.7 


77.9 


58.8 


75.5 


66. 


,7 


44. 


2 


0.0 


4.3 


0.0 


8.6 


11.3 


5.6 


9.7 


7.3 


19.1 


17.6 


15.8 


0. 


,0 


1 


60.0 


65.2 


62.5 


61.7 


62.3 


70.1 


70.9 


65.1 


69.1 


66 .2 


75.4 


66. 


.7 




T 


60.0 


69.5 


62.5 


70.3 


73.6 


75.7 


80.6 


72.4 


88.2 


83.8 


91.2 


66. 


,7 


45. 


2 


0.0 


0.0 


0.0 


9.4 


15.1 


14.0 


20.4 


16.5 


17.6 


33.8 


17.5 


0. 


,0 


1 


4.0 


26.1 


41.7 


44.5 


43.4 


37.4 


51.5 


48.6 


47.1 


47.1 


61.4 


33. 


,3 




T 


4.0 


26.1 


41.7 


53.9 


58.5 


51.4 


71.9 


65.1 


64.7 


80.9 


78.9 


33. 


3 


46. 


2 


8.0 


0.0 


0.0 


14.8 


7.5 


6.5 


19.4 


9.2 


11.8 


41.2 


14.0 


0. 


0 


1 


56.0 


34.8 


41.7 


50.8 


64.2 


53.3 


55.3 


67.9 


69.1 


32.4 


64.9 


66. 


7 




T 


64.0 


34.8 


41.7 


65 .6 


71.7 


59.8 


74.7 


77.1 


80.9 


73.6 


78.9 


66. 


7 


47. 


2 


4.0 


0.0 


4.2 


8.6 


10.4 


6.5 


11.7 


11.0 


14.7 


13.2 


21.2 


33. 


3 


1 


56.0 


56.5 


75.0 


78.9 


73.6 


77.6 


75.7 


82.6 


86.8 


77.9 


77.2 


33. 


3 




T 


60.0 


56.5 


79.2 


87.5 


84.0 


84.1 


87.4 


93.6 


101.5 


91.1 


98.4 


66. 


6 


Num- 
ber of 
Chil- 
dren 


25 


23 


24 


128 


106 


107 


103 


109 


77 


68 


57 


3 



Note: (1) For items 43 through 47, 2 represents the percentage of chil- 
dren getting 2 points, 1 represents the percentage of children 
getting 1 point, and T represents the percentage of children 
getting 1 or 2 points. 
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EXHIBIT 35 (Continued) 



PSI SUBTEST 3 



Item 

No. 




3 






4 






5 






6 




S 


M 


L 


S 


M 


L 


S 


M 


L 


S 


M 


L, 


48. 


44.0 


73.9 


70.8 


89.1 


80.2 


83.2 


95.2 


92.7 


92.2 


97.1 


96.5 


100.0 


49. 


60.0 


69.6 


66.7 


70.3 


70.8 


69.2 


75.7 


84.4 


71.4 


88.2 


89.5 


66.7 


50. 


44.0 


39.1 


58.3 


67.2 


62.3 


54.2 


76.7 


78.9 


70.1 


82.4 


91.2 


100.0 


51. 


0.0 


8.7 


0.0 


3.1 


1.9 


1.9 


7.8 


6.4 


3.9 


25.0 


7.0 


33.3 


52. 


24.0 


17.4 


8.3 


21.9 


27.4 


29.0 


42.7 


45.0 


39.0 


69.1 


70.2 


66.7 


53. 


36.0 


21.7 


37.5 


60.9 


57.6 


49.5 


74.8 


71.6 


57.1 


79.4 


80.7 


100.0 


54. 


8.0 


8.7 


12.5 


26.6 


27.4 


25.2 


46.6 


48.6 


28.6 


54.4 


68.4 


100.0 


55. 


24.0 


0.0 


20.8 


26.6 


27.4 


16.8 


29.1 


41.3 


9.1 


26.5 


54.4 


0.0 


56. 


0.0 


8.7 


8.3 


5.5 


10.4 


9.4 


13.6 


17.4 


6.5 


13.2 


15.8 


0.0 


57. 


68.0 


43.5 


50.0 


71.1 


66.0 


77.6 


83.5 


80.7 


92.2 


94.1 


91.2 


100.0 


58. 


16.0 


4.4 


16.7 


25.0 


25.5 


22.4 


37.9 


33.0 


29.9 


42.7 


59.7 


66.7 


59. 


64.0 


69.6 


66.7 


73.4 


81.1 


74.8 


86.4 


89.9 


80.5 


86.8 


86.0 


100.0 


60. 


4.0 


4.4 


0.0 


5.5 


7.6 


13.1 


10.7 


27.5 


15.6 


11.8 


33.3 


0.0 


61. 


28.0 


56.5 


37.5 


34.4 


45.3 


32.7 


38.8 


44.0 


42.9 


52.9 


54.4 


66.7 


62. 


8.0 


13.0 


16.7 


43.8 


41.5 


41.1 


58.3 


66.1 


62.3 


66.2 


87.7 


66.7 


63. 


20.0 


13.0 


33.3 


34.4 


34.9 


44.9 


49.5 


63.3 


54.6 


70.6 


86.0 


66.7 


64. 


8.0 


8.7 


8.3 


21.1 


24.5 


24.3 


35.9 


50.5 


40.3 


61.8 


82.5 


33.3 


65. 


0.0 


8.7 


0.0 


4.7 


15.1 


9.4 


16.5 


21.1 


22.1 


44.1 


38.6 


0.0 


66. 


8.0 


4.4 


8.3 


3.9 


8.5 


15.0 


18.5 


15.6 


22.1 


29.4 


29.8 


0.0 


Num- 
ber of 
Chil- 
dren 


25 


23 


24 


128 


106 


107 


103 


109 


77 


68 


57 


3 
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EXHIBIT 35 (Continued) 



PSI SUBTEST 4 



Item 




3 






4 






5 






6 




No. 


S 


M 


L 


S 


M 


L 


S 


M 


L 


S 


M 


L 


67. 


92.0 


73.9 


91.7 


89.8 


94.3 


96.3 


94.2 


96.3 


92.2 


100.0 


100.0 


66.7 


68. 


72.0 


56.5 


70.8 


82.0 


90.6 


93.5 


95.2 


96.3 


93.5 


97.1 


100.0 


100.0 


69. 


16.0 


13.0 


20.8 


39.1 


49.1 


52.3 


63.1 


77.1 


75.3 


92.7 


91.2 


100.0 


70. 


4.0 


0.0 


8.3 


23.4 


26.4 


30.8 


42.7 


52.3 


49.4 


73.5 


77.2 


66.7 


71. 


76.0 


60.9 


83.3 


84.4 


89.6 


84.1 


94.2 


89.9 


93.5 


94.1 


89.5 


100.0 


72. 


48.0 


34.8 


66.7 


53.9 


56.6 


53.3 


65.1 


69.7 


59.7 


77.9 


66.7 


100.0 


73. 


68.0 


43.5 


54.2 


68.8 


67.0 


59.8 


82.5 


77.1 


71.4 


88.2 


89.5 


66.7 


74. 


40.0 


60.9 


41.7 


50.0 


60.4 


57.9 


68.0 


62.4 


61.0 


82.4 


82.5 


33.3 


75. 


52.0 


43.5 


41.7 


74.2 


75.5 


72.9 


86.4 


85.3 


75.3 


94.1 


89.5 


100.0 


76. 


20.0 


39.1 


45.8 


50.8 


50.0 


43.9 


54.4 


59.6 


63.6 


55.9 


64.9 


33.3 


77. 


40.0 


39.1 


33.3 


60.9 


52.8 


58.9 


76.7 


77.1 


74.0 


86.8 


73.7 


100.0 


78. 


56.0 


47.8 


54.2 


62.5 


49.1 


56.1 


66.0 


63.3 


54.6 


67.7 


70.2 


100.0 


79. 


52.0 


47.8 


50.0 


64.8 


74.5 


69.2 


87.4 


81.7 


77.9 


86.8 


89.5 


100.0 


80. 


48.0 


47.8 


62.5 


64.1 


66.0 


67.3 


84.5 


84.4 


77.9 


85.3 


96.5 


100.0 


81. 


20.0 


13.0 


25.0 


24.2 


34.0 


25.2 


38.8 


45.9 


27.3 


57.4 


64.9 


33.3 


82. 


20.0 


30.4 


54.2 


47.7 


57.6 


53.3 


68.0 


67.9 


61.0 


70.6 


68.4 


100.0 


83. 


24.0 


43.5 


50.0 


57.0 


61.3 


57.0 


71.8 


76.2 


70.1 


88.2 


84.2 


100.0 


84. 


20.0 


13.0 


8.3 


36.7 


37.7 


36.5 


55.3 


56.0 


48.1 


69.1 


73.7 


66.7 


85. 


52.0 


34.8 


41.7 


53.1 


65.1 


57.9 


72.8 


83.5 


75.3 


86.8 


87.7 


66.7 


Num- 
ber of 
Chil- 
dren 


25 


23 


24 


128 


106 


107 


103 


109 


77 


68 


57 


3 
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each age level and program type) who patied the item ii lifted. For 
items 43 through 47 in subtest 2, the percentage of children receiving 
2 points or 1 point is shown separately, plus the percentage (T) of 
children receiving either 1 or 2 but not 0. 

Comparisons within a column--that is, between items--are based 
on responses of the same sample of children. Comparisons across 
columns are comparisons between groups. The reader should thus 
take into account changes in denominators. 

2. Statistical Analyses 

This subsection will present the results of covariance and 
regression analyses performed on the data. While a large number of 
analyses and results are given, the reader will note that they are simply 
variations of two basic methods of analysis with two types of observa- 
tion.^ Thus, there are: 

a. Analyses based on the CDC as the unit of observation: 

(1) Multivariate analysis of variance with multiple 
covariance adjustments.^' 

(2) Multivariate multiple regression analysis.^ 

b. Analyses based on the individual child as the unit of 

observation: 

(1) Multivariate analysis of covariance.^ 

(2) Univariate stepwise regression analysis.^ 

For convenience, the first will be referred to as "By- Center" analyses, 
and the second as "By-Child" analyses. 



See the detailed discussion of statistical models in Appendix B. 

2 

Cooley, William W. and Paul R. lohnes. Multivariate Procedures 
for the Behavioral Sciences (Chapter 4). New York,' Wiley and 
' Sons, Inc., 1962 

3 . ^ 

Anderson, T.W., Introduction to Multivariate Statistical Anal ysis 
(Section 8.2). New York: Wiley and Sons, Inc., 1958 

4 

Cooley, loc. cit. 

5 

Intertest correlations were eventually calculated for one of these 
analyses, thus making it in effect multivariate. 
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«>• By- Center Analyses 

(1) Analyaes of C ovariance with All Elicribl« 
Children Tested in £ngligh i ® 

The first covariance analysis mace used the 
following covariates and measures for each of 69 CDC's: 

• Age: the average age in months of the center's sample of 
eligible children. 

• Sex: the proportion of males in the center's sample. 

• Race: the proportion of non-whites in the center's sample. 

• Population (Community Size): the log of the population of 
the CDC's town. 

• Poverty: the proportion of children defined as poor in the 
center's sample. 

The classification of a child as "poor" or "not poor" was made 
by entering in Exhibit 36 the income and household size data from the 
Family Information Form fUled out by parents. ^ 



CDC's°''^Hotelt*r ^ It contained 831 children from 71 

C s. However, 67 of these cnilaren were tested in Spanish. In 

covariance a^lyses. language of testing was included as a 

*'°vever. that because of the very badly 
S*^ M Pf.oPortion of children tested in Spanish across 

weM’df^nrLo*«*“’^*<.’ vere being made in means that 

rnch ai f ^ for other covariates 

SroLed Spanish-speaking children were 

” *** sample for subsequent By-Center analyses. A By- 
Child stepwise regression analysis was made for this subgroup; the 
results of that analysis are reported in subsection IV. B. 2. b. (2). (For 

te* C:o°ch^an‘*W ‘V of selection and use of covariates 

°f Covariance: Its Nature and Uses," 
^®P‘®“^®' 1957, pp. 261-281.) Unless otherwise indicated, 
all analyses reported are based on the children tested in English. 

2 _, 

The poverty index used here is not identical with OEO's index, since 
the mcome increments on CAP-HS Form 46 or 46a (dated 20 June 1966) 
are different from those required by the OEO poverty scale. In addi- 
tlon no attempt was made to distinguish between farm and non-farm 
child'!” *i°r regional adjustments. If all the center's eUgible 
WM^.ncomo'l^t ” I*® because the Family Information Form 

dreVwa.^Md *® Proportion of classifiable eUgible chil- 
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EXHIBIT 36 - POVERTY AS A FUNCTION OF HOUSEHOLD SIZE AND 

INCOME 



Household 

Size 


Income 


$1,000 

to 

1,999 


$2,000 

to 

2,999 


$3,000 

to 

3,999 


$4,000 

to 

4.999 


$5,000 

or 

Over 


1 


Poor 




NP 


NP 


NP 


2 


Poor 


NP 


NP 


NP 


NP 


3 


Poor 


Poor 


NP 


NP 


NP 


4 


Poor 


Poor 


NP 


NP 


NP 


5 


Poor 


Poor 


Poor 


NP 


NP 


6 


Poor 


Poor 


Poor 


NP 


NP 


7 


Poor 


Poor 


Poor 


Poor 


NP 


8 or more 


Poor 


Poor 


Poor 


Poor 


NP 



Note: (1) NP denotes "not poor." 



The dependent variable is a 4- variate observation of the form: 



where 



Center average 
„ _ Center average 

“ Center average 
Center average 

j = S. M. L 

i — Ij 3| •••} n. 

ng = 22 

“m 

n_ =21 

^ A 



PPVT raw score 
PSI total raw score 
BI total raw score 
VSMS raw score 



An adjusted mean for measure k (e.g., PPVT) for con 

dition j (S, M, or L) is obtained by: 




^jk " [^1 ^^ij " ^ 1 . ^ ^2 ^^2j 

+ ’’c )] 







• • • 
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where = the unweighted mean of meaauring k in treatment level j 

X . s the unweighted average of control variable X for treat- 
ment level j ^ 

^c. “ tot&l unweighted average of c over all conditions 

= the beta coeificient for control variable c 

Exhibit 37 shows obtained and estimated parameters of the mf 
sures or dependent variables for the total sample of 69 CDC's. 

EXHIBIT 37 - TOTAL SAMPLE PARAMETERS FOR DEPENDENT 

VARIABLES 



Dependent 

Variable 


Total Sample 
Mean 


Sample 

Standard 

Deviation 


Estimate 
of Center 

Standard Deviation 


PPVT 


39.63 


5.99 


4.39 


I SI 


46.7 5 


8.65 


5.22 


BI 


143.33 


8.95 


8.80 


VSMS 


55.28 


4.77 


4.57 



Exhibit 38 lists the total unweighted mean and standard deviation 
of each covariate. The reader should recall that these are based on the 
distribution of center parameters in each case. 



EXHIBIT 38 - COVARIATE MEANS AND STANDARD DEVIATIONS 



Covariate 


Total Sample 
Mean 


Standard 

Deviation 


Age in months 


60.00 


6.49 


Percentage of males 


51.90 


16.78 


Percentage of non-white children 


56.73 


33.81 


Log of population of town 


5.27 


1.04 


Percentage of poor 


39.05 


25.21 



PRC R-886 
96 



Exhibit 39 shows the unadjusted and adjusted means for each 

test for S, M, and L centers. It also shows the total increment added 

to or subtracted from each unadjusted meani along with the components 

of the total increment provided by each control variable or covariate. 

For example, the unweighted average PPVT center average raw score 

for the S centers was 38.64. The adjusted mean was 39.03, obtained 

by addition of the algebraic sum of the components to the unweighted 

mean (the component signs shown are the resultants of - b (X . - X 

L ^ ’ 

Thus, 38.64 plus 0.39 equals 39.03; 41.73 minus 2.16 equals 39.57; etc. 

The test criterion for the multivariate covariance analysis is the 
Wilks' Lambda, A, which is like the F-ratio of a univariate analysis 
of variance. In this analysis, A equals 0.8511. A transformation 
of this yields: 



^8,116 = 0-05 



Thus, the analysis does not support the hypothesis that there are 
significant differences between the means of the S, M, and L centers 
for the 4 -variate dependent variable.^ 

The F-ratio for each measure taken individually was calculated. 
These were: 



Test 


^2,61 


Significance 


PPVT 


0.38 


p >0.05 


PSI 


0.04 


p >0.05 


BI 


3.47 


p<0.05 


VS MS 


1.20 


p >0.05 



The overall sample was designed to detect differences between PPVT 
raw score means of six to seven points with a power of about 0.70. In 
fact, the analysis permitted sufficient reduction of error variance to 
detect differences of five points with a power of about 0.80. It may be 
noted here also that there was no significant difference in statistical 
outcome when the same type of analysis included Spanish-tested eligible 
and included language as a covariate. 
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0.1257 
0.0731 , 

-0.2222 


0.0780 

0.0454 

-0.1380 


0.1431 

0.0833 

-0.2529 


0.1519 

-0.0878 

0.2667 


I ^ 

Population 
I Increment 


-0.0365 

-0.1459 

0.2165 i 

■1 


-0.0990 
0.3958 ; 
0.5871 


1 0.0416 

0.1665 
-0.2470 1 


O O X 

X csj oc 

CO in cr 

^ r- in 

« « • 

O (NJ 

1 


^ Race 

1 Increment 


0.3084 

-1.0886 

1.0247 


0.1679 

-0.5926 

0.5578 


00 CO nO 
^ M in 
m M 
O 

• • • 

o o o 
1 1 


0.6928 
-2.4455 
2.3019 1 


1g 

V 

X £ 
(P 

in u 
u 

a 


-0.0361 

0.0121 

0.0229 


-0.2064 

0.0690 

0.1308 


-0.0752 
0.0252 S 
0.0477 


0.1265 : 

-0.0423 
-0.0801 


[ Age 

[ Increment 


0.0274 
-1.0138 I 
1.2275 


0.0494 

-1.8267 

2.2118 


0.0168 
-0.6226 ! 
0.7538 


0.0110 

-0.4082 

0.4942 


Total 

Increment ' 


0.39 

-2.16 

2.27 


fsj o^ m 

O ro 

s • s 

O csj m 

t 1 


1 0.09 

I -0.22 

0.19 


<M ^ vO 

^ fsj ro 

• s s 

sM <3 

1 


Adjusted 
Mean i 

i 


po ^ 

o in pp> 

s s s 

O 

ro CO ^ 


46.48 
46.80 
46.96 j 


55.58 
53.94 
56.63 ! 


145.34 ; 
138.74 
146.90 i 


Unweighted 

Mean 


38.64 

41.73 

38.07 


46.50 

49.49 

43.61 


55.49 

54.16 

56.44 


144.22 

139.98 

146.54 


Program 

Duration 


Short 

Medium 

Ijong 


Short 

Medium 

Long 


Short 

Medium 

Long 


Short 

Medium 

Long 


Test 


PPVT 


PSI 


VSMS 


IH 



o 

ERIC 
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Thus, there waa a significant difference in means for the Behavior 
Inventory, by the criterion of a = 0.05 , Examination of the adjusted 
means of the BI (see Exhibit 39 above) suggests that the mean of the M 
centers was significantly different from the means of S and L sample 
centers. The t- values for these two comparisons are significant at the 
5 percent level (the t- value of the S-L contrast, of course, is not signif- 
icant at that level). While the criterion a = 0,05 is probably too liberal 
for the F-ratios, since we are making multiple, correlated comparisons, 
it is clear that the 5 to 6 percent difference from upper to lower BI 
means is unique among the four measures. 

It is apparent in Exhibit 39 that for the individual tests (PPVT 
and PSI), the variables that generally contribute most to the adjustment 
of the means are race, population, and age. Poverty appears more 
related to PPVT than to PSI scores. With the Vineland Social Maturity 
Scale, poverty and population appear to take on more weight, and with 
the Behavior Inventory, the population and race components seem 
predominant. 

(2) Multivariate Multiple Regression Analysis 

To examine the relative contribution of differ- 
ent variables to the prediction of center average scores, PRC per- 
formed a multivariate multiple regression analysis. In this analysis, 
the treatment variable was included as a continuous variable, deter- 
mined for a given center by dividing the sum of the number of weeks 
of enrollment in the program of aJU. children in the sample by the num- 
ber of children in the sample. 

The model for this analysis was of the form: 

A 

? = a+pjXj +P2X2 + ... 

The independent variable. were the .ame a. tho.e 

used in the By- Center covariance analysis, plus the exposure variable 
defined above and measured in weeks. 



o 

ERIC 
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Exhibit 40 shows, for each measure, the estimates of the constant 
term a , the |3 coefficients, and the standard deviation of the coeffi- 
cient. The unweighted center average for each test is listed for 
convenience. 

A question of immediate interest is whether or not the (3 coeffi- 
cient for a given variable is really different from zero. To measure 
this probability, the 95 percent confidence limits for each p were 
calculated,^ Exhibit 41 shows for each measure those independent 
variables whose weight ( p ) had a confidence interval that did not 
include zero. 

The percentage of total variance for each test attributed to the 
six independent variables taken together is approximately: 



The PPVT was used to make some checks on the adequacy of the 

model; in particular, examinations of a possible non-linear effect of 

age and the general size of a sex-race interaction were desirable. 

These factors were investigated using the By-Center data calculated 

2 

from the total sample; including the children tested in Spanish, A 
calculation was first made of the percentage of total variance of the 

The reader can readily calculate this interval for himself from the 
parameters given in Exhibit 40, 

2 

In many respects, parameters of the two samples (Total Sample of 
Children Tested in English and Total Sample of Children Tested) were 
very similar. Means and variances of the independent variables other 
than language of testing were similar, Wilk's Lambda for the co- 
variance analysis was about the same and not significant in either case 
The total percentage of dependent variable variance accounted for by 
the independent variance was much the same in both cases. For these 
reasons, it seems reasonably safe to conclude that conclusions about 
a sex-race interaction and a non-linear age effect drawn from one 
sample probably apply to the other. 



Test 



Percent 



PPVT 

PSI 

BI 

VS MS 



62 

74 

9 

14 



PRC R-886 

100 . 



EXHIBIT 40 - MULTIPLE REGRESSION PARAMETERS FOR EACH TEST 



Test 


Independent 

Variable 


Estimate of 
P Coefficient 


CTp 


a 

(Cons lane 
Term^^ ' for 
Test Indicated) 


Unweighted 
Average 
of Center 
Means for 
Test Indicated 


PPVT 


Age 


0.5382 


0.0920 


10.96 


39.61 




Sex 


0.0178 


0.0324 








Race 


-0.0460 


0.0155 








Population 


-0.2711 


0.5676 








Poverty 


-0.0377 


0.0233 








Exposure 


0.0560 


0.0777 






PSI 


Age 


0.9584 


0.1090 


-3.55 


46.84 




Sex 


-0.0251 


0.0384 








Race 


-0.0324 


0.0183 








Population 


-0.8095 


0.6723 








Poverty 


-0.0281 


0.0276 








Exposure 


0.0762 


0.0921 






BI 


Age 


0.2850 


0.2025 


113.02 


142.86 




Sex 


0.0131 


0.0714 








Race 


-0.0145 


0.0340 








Population 


2.7182 


1.2496 








Poverty 


0.0228 


0.0512 








Exposure 


-0.1142 


0.1711 






VS MS 


Age 


0.3133 


0.1071 


36.69 


55.21 




Sex 


-0.0238 


0.0377 








Race 


0.0184 


0.0180 








Population 


0.0332 


0.6606 








Poverty 


-0.0288 


0.0271 








Exposure 


0.0466 


0.0905 







Note; (1) Y-intercept of regression line 
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EXHIBIT 41 - INDEPENDENT VARIABLES WITH SIGNIFICANT BETA 

COEFFICIENTS, BY TEST 



Dependent 

Variable 


Independent Variable 


Age 


Sex 


Race 


Exposure 


Population 


Poverty 


PPVT 


x(i) 




X 








PSI 


X 












BI 










X 




VSMS 


X 













Note: (1) An X indicates that the 95 percent confidence interval for 
the independent variable's p coefficient does not include 
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center average PPVT raw ecoree attribuUble to Age (A), Sex (S), and 
Race (R) combined. The exposure variable W , the non-linear age 
variable A^ , and finally a sex- race interaction component SR were 
then added. The percentages of total variance attributable to these 
variables were approximately: 



A, S, R 


54.6 


A, S, R, W 


56.6 


A, S, R, W, A^ 


57.9 


A, S, R, W.A^.SR 


58.2 



In effect, by this procedure, the addition of the exposure variable 
increased the variance accounted for by about 2 percent of the total 
variance. The non-linear age variable accounted for another 1.3 per- 
cent, and the sex- race interaction component for yet another 0.3 per- 
cent. In these terms, it appears that the exposure variable adds little 
to center average prediction, and that prediction for the PPVT is not 
substantially influenced by non-linear age effects or by sex-race 
Interactions. 

(3) Multivariate Multiple Regression Analysis 

With Total Estimated Exposure as an jSadepend- 
ent Variable 

The exposure variable in the first regression 
analysis was based on weeks of enrollment of a child at time of testing. 
The daily length of programs varied to some extent (from 2 to 7 hours). 
To examine the possibility that total hours of exposure waff a more 
significant measure than number of weeks regardless of hours per 
week, the exposure variable was converted and a By- Center multi- 
variate multiple regression analysis was performed. This analysis 
utilized the total sample of children, including those tested in Spanish. 
The analysis also used language of testing as an independent variable. 

In this analysis, the exposure variable was obtained by multi- 
plying the center average weeks for the total center sample by five 
times the number of daily hours of the center's program. No attempt 



o 
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w&t made to determine the actual houri of attendance of each child. 

Thus, the measure is an approximation of the actual amount of exposure. 

The results of this analysis were essentially the same as those of 
the primary regression analysis, described in subsection IV. B. 2. a. (2) 
above. As before, the 95 percent confidence limits for the p coeffi- 
cient of the exposure variable included zero. Thus, no support was 
found for the hypothesis that total amount of exposure was any more 
significant a predictor than program length. 

b. By- Child Analyses 

(1) Analyses of Covariance With Eligible English- 
Tested Children in Subpopulations Defined by 
Age, Sex, and^Race 

Four-variate analyses of covariance, similar 
to the analyses described in subsections IV. B.2.a. (1) and IV. B. 2.a. (2), 
were made with 10 subgroups of children defined by age, sex, and race. 
For these analyses, the observations were individual scores, rather 
than center averages. In performance of these analyses, four co- 
variates were used: 

• Family income 

• Family size 

• Population- community size (again expressed as the log of 

the population of the child's town) 

• Daily program (number of hours per day of the child's CDC 
program) 

Since data for the two family covariates came from the Family 
Information Form collected by the Bureau of the Census by mail, there 
were losses of children from the subsamples owing to missing data. 

In some cases, forms were not returned by parents. In some cases, 
incomplete forms were returned. Since, for the By-Child analyses, 
a child was dropped from the analysis If there were any missing data, 
the resultant sample sizes were substantially reduced. No analysis 
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of the type! of children for whom family data were mining hai been 
made. The 10 gronpa analyzed, with the number of children in each 
group, are lifted in Exhibit 42. 

For convenience of reference, data on the independent variable! 
and results for each group have been assembled in a single exhibit. 
Thus, Exhibit 43 shows, by group and treatment level within each 
group, the average for each covariate, and the unweighted and adjusted 
test means. 

A group is starred if the Wilk*s Lambda was significant at the 
0.05 level. None of the F- ratios for the individual dependent variables 
was significant at the 5 percent level. 

Finally, the total and component covariate increments for each 
group, level, and test are listed in Exhibit 44. 

(2) Multiple Stepwise Regression Analyses 

Multiple stepwise regression analyses were 
made with a number of dependent and independent variables. These 
analyses, unlike the By-Child covariance analyses, were univariate. 
The dependent variables analyzed were: 

• PPVT Raw Score 

• PSI Total Raw Score 

• BI Total Raw Score 

• VSMS Raw Score 

• PSI Subtest 1 (Personal-Social Responsiveness) 

• PSI Subtest 2 (Associative Vocabulary) 

• PSI Subtest 3 (Numerical Concept Activation) 

• PSI Subtest 4 (Sensory Concept Activation) 

• PPVT IQ 

• VSMS Social Quotient 

It is convenient to think of the independent variables in terms of 
groups or subsamples of children as well as of names or labels* The 
groups, identified by name and number, are listed below with the 
independent variables (and their identification number) used with each 



group. 
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EXHIBIT 44 - SUBGROUP COVARIANCE INCREMENTS 



GROUP 1: THREE- YEAR-OLD MALES, NON-WHITE 



Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


-1.8391 


-0.2882 


4.3651 


-3.1530 


-0.85 


PPVT 


M 


-1.3283 


-0.9568 


-0.4774 


0.5655 


-2.16 




L 


1.9924 


0.7032 


-3.0351 


2.0563 


1.75 




s 


-4.1135 


-0.3272 


3.0889 


7.9930 


6.65 


PSI 


M 


-2.9709 


-1.0863 


-0.3379 


-1.4335 


-5.78 




L 


4.4563 


0.7984 


-2.1478 


-5.2128 


-2.10 




S 


-2.7713 


-1.7718 


8.0963 


0.7188 


4.28 


BI 


M 


-2.0015 


-5.8824 


-0.8855 


-0.1289 


-8.87 




L 


3.0023 


4.3232 


-5.6295 


-0.4688 


1.23 




S 


-0.5927 


-0.0986 


2.1199 


5.6239 


7.03 


VSMS 


M 


-0.4281 


-0.3273 


-0.2319 


-1.0086 


-2.00 




L 


0.6421 


0.2406 


-1.4740 


-3.6678 


-4.28 


C 


3ROUP 2. THREE-YEAR-OLD FEMALES, NO 


N-WHITE 




1 








Family 






Test 


Duration 


Population 


Income 


Size 


Hours 


Total 




S 


-0.4810 


-2.0616 - 


22.0441 


-0.1757 


-24.78 


PPVT 


M 


-8.4650 


0.7542 


6.6132 


-0.0711 


-1.13 




L 


6.9259 


0.2011 


3.5271 


0.1276 


10.82 




S 


-0.2003 


0.0134 


-2.4304 


0.1012 


-2.53 


PSI 


M 


-3.5261 


-0.0049 


0.7291 


0.0410 


-2.75 




L 


2.8850 


-0.0013 


0.3889 


-0.0735 


3.21 




S 


-0.1256 


-1.6859 


11.1527 


-0.0758 


9.26 


BI 


M 


-2.2097 


0.6168 


-3.3458 


-0.0307 


-4.95 




L 


1.8080 


0.1645 


-1.7844 


0.0551 


0.26 




S 


-0.1345 


0.8533 


5.8591 


0.1501 


6.72 


VSMS 


M 


-2.3677 


-0.3122 ■ 


-1.7577 


0.0608 


-4.37 




L 


1.9372 


-0.0833 ■ 


■0.9375 


-0.1090 


0.81 
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EXHIBIT 44 (Continued) 



GROUPS: FOUR -YEAR -OLD MALES, WHITE 



Test 


Duration 


Population 


Inccnne 


F amily 
Size 


Hours 


Total 




S 


0.0511 


0.3298 


-1.4574 


-0.2525 


-1.33 


PPVT 


M 


0.1277 


-0.0256 


0.7114 


-0.2666 


0.56 




L 


-0.8002 


-0.5189 


-1.0237 


1.9361 


-0.40 




S 


-0.1502 


-0.0110 


-0.8624 


-0.4617 


-1.47 


PSI 


M 


-0.3756 


0.0009 


0.4209 


-0.4874 


-0.43 




L 


2.3536 


0.0173 


-0.6057 


3.5398 


5.31 




S 


0.1509 


0.8466 


-0.0581 


-1.1141 


-0.18 


BI 


M 


0.3773 


-0.0656 


0.0284 


-1.1760 


-0.81 




L 


-2.3645 


-1.3322 


-0.0408 


8.5416 


4.82 




S 


-0.0491 


1.3848 


-0.0462 


-0.1628 


1.13 


VSMS 


M 


-0.1227 


-0.1074 


0.0225 


-0.1718 


0.38 




L 


0.7686 


-2.1791 


-0.0324 


1.2481 


0.18 



GROUP 4: FOUR -YEAR -OLD MALES, NON-WHITE 



Test 


Duration 


Population 


Income 


F amily 
Size 


Hours 


Total 




S 


-0.37"'4 


-0.0683 


0.4531 


0.4794 


0.47 


PPVT 


M 


-0.956i 


0.3732 


0.1708 


-0.4330 


-0.85 




L 


0.6070 


-0.0319 


0.4605 


-0.3248 


-0.21 




S 


0.0069 


-0.0039 


0.1723 


0.0075 


0.18 


PSI 


M 


0.0173 


0.0211 


0.0650 


-o:oo67 


0.10 




L 


-0.0110 


-0.0018 


-0.1751 


-0.0050 


- 0.19 




S 


0.1537 


-0.0268 


0.4943 


2.2978 


2.89 


BI 


M 


0.3873 


0.1467 


0.1864 


-2.0754 


-1.37 




L 


-0.2459 


-0.0125 


-0.5024 


-1.5565 


-2.34 




S 


0.4927 


-0.0715 


0.1388 


-0.0813 


0.50 


VSMS 


M 


1.2416 


0.3907 


0.0523 


0.0735 


1.76 




L 


-0.7883 


-0.0334 


-0.1411 


0.0551 


-0.89 
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GROUPS: FOUR-YEAR-OLD FEMALES, WHITE 



Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


0.5650 


0.1495 


-2.1473 


0.0000 


-1.40 


PPVT 


M 


-0.5336 


-0.1522 


2.0936 


-0.2871 


1.13 




L 


1.2869 


0.4676 


-5.9587 


2.4043 


-1.76 




S 


0.3868 


0.1869 


-0.6879 


0.0000 


-0.08 


PSI 


M 


-0.3653 


-0.1903 


0.6707 


-0.3251 


-0.20 




L 


0.8811 


0.5845 


-1.9088 


2.7223 


2.31 




S 


-0.7302 


2.4734 


1.6486 


0.0000 


3.38 


BI 


M 


0.6897 


-2.5186 


-1.6074 


0.0781 


-3.36 




L 


1.6633 


7.7356 


4.5749 


-0.6544 


9.95 




S 


-0.1518 


0.1173 


-0.0756 


0.0000 


-0.11 


VSMS 


M 


0.1433 


-0.1195 


0.0737 


-0.0346 


0.07 




L 


-0.3457 


0.3669 


-0.2098 


0.2896 


0.09 


GROUP 6: 


FOUR-YEAR- 


OLD FEMALES, NON-WHITE 




Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


0.1046 


-0.3898 


-0.3053 


-0.1153 


-0.71 


PPVT 


M 


0.5150 


0.9208 


-0.1654 


0.4718 


1.74 




L 


-0.3058 


0.1680 


0.4453 


-0.0105 


0.30 




S 


-0.2290 


-0.4061 


-0.0196 


-0.0930 


-0.74 


PSI 


M 


-1.1274 


0.9593 


-0.0106 


0.3806 


0.22 




L 


0.6694 


0.1751 


0.0286 


-0.0085 


0.87 




S 


0.1748 


-0.6564 


0.2761 


-0.3433 


-0.56 


BI 


M 


0.8603 


1.5504 


0.1496 


1 .4045 


3.95 




L 


-0.5108 


0.2829 


-0.4026 


-0.0312 


-0.66 




S 


0.0154 


-0.4977 


-0.0214 


0.0213 


-0.48 


VSMS 


M 


0.0757 


1.1756 


-0.0116 


-0.0873 


2.84 




L 


-0.0449 


0.2145 


0.0312 


0.0019 


0.21 
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EXHIBIT 44 (Continued) 



GROUP 7: FIVE-YEAR-OLD MALES, WHITE 



Test 


Duration 


Population 


Income 


Family 

Si7.e 


Hours 


Total 




S 


0.0357 


0.1724 


-0.3312 


-0.1643 


-0.28 


PPVT 


M 


0.0031 


0.3135 


-0.0936 


0.0758 


0.30 




L 


-0.0689 


-2.1006 


1.0295 


-0.1980 


-1.33 




S 


-1.3850 


-0.0011 


-0.2173 


-0.1085 


-1.70 


PSI 


M 


-0.1194 


-0.0019 


-0.0614 


0.0501 


-0.12 




L 


2.6746 


0.0129 


0.6755 


-0.1308 


3.26 




S 


1.2533 


0.3452 


-0.7496 


-0.9925 


-0.14 


BI 


M 


0.1080 


0.6277 


-0.2118 


0.4581 


0.98 




L 


-2.4202 


-4.2057 


2.3302 


-1.1961 


-5.49 




S 


0.0759 


-0.0237 


0.0843 


0.5479 


0.68 


VSMS 


M 


0.0065 


-0.0431 


0.0238 


-0.2529 


-0.27 




L 


-0.1465 


0.2884 


-0.2621 


0.6603 


0.54 



GROUP 8: FIVE-YEAR-OLD MALES, NON-WHITE 



Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


-0.5460 


0.0217 


-0.3218 


-0.3570 


-1 .22 


PPVT 


M 


-2.0178 


0.3407 


1.8391 


0.7288 


0.88 




L 


1.5905 


-0.1649 


-0.2644 


0.2082 


1.36 




S 


-0.3229 


0.0459 


-0.0923 


0.2055 


-0.17 


PSI 


M 


-1.1933 


0.7200 


0.5273 


-0.4195 


-0.36 




L 


0.9406 


-0.3485 


-0.0758 


-0.1199 


0.40 




S 


-1.4692 


0.0644 


0.3502 


-0.4898 


-1.57 


BI 


M 


-5.4297 


1.0109 


-2.0009 


1.0000 


-5.43 




L 


4.2799 


-0.4894 


0.2876 


0.2857 


4.37 




S 


0.08147 


0.0227 


0.0878 


-0.3504 


-0.16 


VSMS 


M 


0.3011 


0.3563 


-0.5014 


0.7154 


0.70 




L 


-0.2373 


-0.1725 


0.0721 


0.2044 


-0.05 
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GROUP 9: FIVE-YEAR-OLD FEMALES, WHITE 



Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


0.1400 


-0.1095 


0.0049 


-0.0679 


-0.03 


PPVT 


M 


-0.1600 


-0.1264 


-0.0027 


0.0124 


-0.28 




L 


0.7098 


1.4998 


0.0000 


0.2284 


2.44 




S 


0.1836 


-0.1210 


0.1303 


-0.1381 


0.06 


PSI 


M 


-0.2098 


-0.1397 


-0.0708 


0.0251 


-0.40 




L 


0.9309 


1.6573 


-0.0009 


0.4647 


3.05 




S 


-0.3385 


-0.1427 


-0.6641 


0.0993 


-1.03 


BI 


M 


0.3868 


-0.1647 


0.3609 


-0.0181 


0.57 




L 


-1.7165 


1.9539 


0.0048 


-0.3341 


-0.09 




S 


0.1986 


-0.0196 


0.3963 


-0.0508 


0.52 


VSMS 


M 


-0.2270 


-0.0227 


-0.2154 


0.0092 


-0.46 




L 


1.0072 


0.2689 


-0.0029 


0.1708 


1.45 




GROUP 10: 


FIVE-YEAR-OLD FEMALES, NON-WHITE 


Test 


Duration 


Population 


Income 


Family 

Size 


Hours 


Total 




S 


-1.6516 


0.1102 


0.7430 


0.0521 


-0.77 


PPVT 


M 


-1.8404 


-0.3098 


-1.1491 


-0.1910 


-3.52 




L 


2.1235 


-0.0138 


-0.3197 


0.0124 


1.77 




S 


-1.4040 


0.2162 


1.1196 


-0.1054 


-0.18 


PSI 


M 


-1.5645 


-0.6080 


-1.7315 


0.3865 


-3.55 




L 


1.8052 


-0.0270 


-0.4817 


-0.0251 


] .24 




S 


-0.3595 


0.2777 


1.0182 


-0.4029 


0.55 


BI 


M 


-0.4005 


-0.7810 


-1.5746 


1.4773 


-1.28 




L 


0.4622 


-0.0347 


-0.4381 


-0.0959 


-0.11 




S 


-0.5836 


0.0128 


0.0832 


-0.2516 


-0.74 


VSMS 


M 


-0.6503 


-0.0359 


-0.1287 


0.9225 


0.10 




L 


0.7504 


-0.0016 


-0.0358 


-0.0599 


0.35 
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• Group 1 utilized all eligible children tested in English 
(N =517); 

• Group 4 utilized all non-eligible children tested in English 
(N = 87); and 

• Group 5 utilized children tested in Spanish (N = 42), 
for these independent variables: 

0 - Program Lrength in Weeks (Weeks) 

1 - Age 

2 - Mother's Education (M. Ed.)^ 

3 - Race 

4 - Sex 

5 - Daily Program Length in Hours (Hours) 

6 - Population (Pop. ) 

7 - Family Size (F. Size) 

8 - Center Size--Number of Classes in CDC--(C. Size) 

9 - Family Income (Inc. ) 

• Group 2, obtained from Group 1, (N = 387) utilized variables 
0 through 9, plus: 

10 - Father's Education (F.Ed. )^ 

• Group 3, obtained from Group 2, (N = 280), utilized 
variables 0 through 10, plus: 

11 - Child's CDC's teachers' average amount of 

experience with children from conditions of 
poverty (P-Exp. )^ 

12 - Child's CDC's teachers' average amount of 

experience with pre-schoolers (PS-Exp. )2 



The variables for mother's education and father's education were 
simply numerical values for the scale provided on the Family Informa- 
tion Form: 

1 = No school 5 = 9 to 11 years 

2 = 1 to 3 years 6 = High school graduate 

3 = 4 to 6 years 7 = Some college or college graduate. 

4 = 7 to 8 years 



2 

The variables for teachers' experience were simpl.y numerical values 
given to the scales on the Staff Member Information Form: 

1 = None 

2 = 1 to 3 years 

3 = 4 to 5 years 

4 = Over 5 years 
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Not all dependent variables were analyzed for all groups. High- 
lights of the results of the analyses performed are given in Exhibit 45, 
Since Exhibit 45 summarizes a large amount of information quite com- 
pactly, a detailed explanation is warranted at this point. 

Each row presents the outcomes of a given analysis. The first 
column on the left contains the name of the dependent variable used in 
the analysis (e, g, , PPVT RS denotes PPVT Raw Score), The next 
column gives the group identification number and is followed by a col- 
umn giving the group sample size. For the convenience of the reader, 
the average age (in months) of the group is presented next. The next 
two columns give the mean and standard deviations of the dependent 
variable for the group indicated. In the column following, the per- 
centage of total score variance attributable to Variable 0 (program 
length in weeks) is given, "Weeks” was always forced into the analysis; 
consequently it always entered first. The next column (Total Percent 

of Variance Accounted for by All Independent Variables Used) gives, 

2 

for the indicated analysis, 100 x R from the final step. For example, 
for Group 1, all 10 independent variables together accounted abo'xt 32 
percent of the PPVT Raw Score variance. 

There follows a block of three columns containing t-values for 
the (3 coefficient for the "weeks” variable (Variable 0). The first 
t-value is that obtained initially, when the variable was forced into 
the analysis; the second value is the maximum in the analysis; the 
third is the final value at the end of the analysis. The sign, of course, 
corresponds to the sign of the p weight. For all practical purposes, 
in this exhibit an absolute value of 2,0 or more is associated with a 
probability of 0.95 or more that the coefficient is different from zero. 

Exhibit 45 next lists the independent variables by number in the 
order in which they entered the analysis. It should be noted that the 
order starts with 2, Number 1 was omitted since, as noted above, it 
was always Variable 0, or Weeks, In a given analysis, the order in 
which each variable, other than Variable 0, entered the analysis 
depended in effect on which varialle had the beta coefficient with the 
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HIGHLIGHTS OF RESULTS OF 



STEPWISE REGRESSION ANALYSIS BY T 



[>epende nt 
Variable 


Group 

Number 


Group 

N 


Average 

Age 


OepenOeet 

Variable 

lAean 


DepenOeei 

Variable 

BtaaOard 

Oeviatien 


Initial 
•I Telal 
Variance 

fer Week! 


Total ^ 
uf Variance 
fur All 

InOe pendent 
Variable! U!ed 


T V.lM Ur t 
CMfiici.M Ur W..lw 


Oi 

of InOep 


rOer 

enOei 


1 H> 
1 


mwy 

iriaO 




u 


Starting 


Maximum 


Final 


2 


3 


4 


5 


4 


7 


0 


5 


10 


11 




1 


417 


40.lt 


40.03 


11.20 


0.10 


31.73 


•0.7075 


0.7540 


0.7540 


0 


1 


21 


22 


17 


23 


2 


15 


10 


. 




2 


3B7 


40.71 


41.35 


11.45 


0.25 


34.45 


•1.0bi7 


•1.0410 


0.2070 


0 


1 


20 


21 


17 


15 


23 


2 


10 


22 


PPVT R» 


3 


2t0 


40.45 


41.00 


11.32 


0.24 


31.00 


•0.0550 


•1.3017 


1.3574 


0 


20 


1 


21 


15 


14 


17 


2 


15 


10 




4 


S7 


57.72 


30.40 


11.47 


3.10 


54.51 


-1.4720 


•1.4720 


0.4734 


15 


0 


22 


23 


21 


1 


17 


10 


2 


- 




5 


42 


72.47 


41.t3 


5.44 


14.77 


55.15 


• 2.0350 


•2.0350 


•2.2413 


0 


2 


15 


17 


23 


22 


10 


eU> 


• 


- 




1 


517 


40.lt 


47.03 


14.52 


0.23 


41.40 


•1.0502 


1.3500 


1.3572 


0 


21 


1 


15 


2 


10 


22 


17 


23 


• 




2 


3S7 


40.71 


4t.3t 


15.13 


0.73 


44,74 


•1.4030 


•1.4030 


0.5744 


0 


21 


15 


20 


1 


17 


10 


2 


23 


22 


FBI TeUl 


3 


2B0 


40.45 


40.42 


15.52 


2.10 


45.15 


-2.4054 


•2.4054 


0.0054 


0 


21 


15 


20 


1 


17 


10 


14 


00 


15 




4 


•7 


57.72 


44.04 


14.45 


1.43 


45.50 


0.4503 


1.1543 


1.0547 


15 


0 


21 


23 


2 


t 


22 


17 


10 


- 




5 


42 


72.47 


50.55 


14.10 


20.23 


50.22 


-3.1053 


•3.1053 


•2.2215 


0 


2 


15 


17 


23 


22 


10 


o 


- 


- 




1 


517 


40.lt 


145.41 


23.42 


0.10 


4.70 


0.0122 


0.5040 


0.4500 


21 


0 


2 


15 


22 


1 


17 


23 


10 


• 




2 


3B7 


40.71 


143.23 


23.45 


0.05 


5.04 


-0.4553 


•0.4707 


•U.4544 


21 


0 


17 


2 


15 


1 


20 


23 


02 


e 


B1 Tecal 


3 


2M 


40.45 


142.03 


23.03 


0.05 


0.20 


• 0.31H 


•0.4771 


•0.4050 


17 


20 


0 


15 


1 


23 


21 


2 


14 


22 




4 


•7 


57.72 


143.41 


22.55 


- 


• 


- 


. 


• 
























S 


42 


72.47 


151.05 


21.15 


- 


- 


• 


• 


- 
























1 


517 


4B.lt 


55.31 


0.41 


<0.02 


10.50 


0.0202 


1.3311 


I.OIM 


0 


21 


At 


17 


2 


23 


15 


1 


22 


• 




2 


M7 


4B.71 


55.50 


0.33 


<0.02 


10.53 


-0.0453 


1.0504 


0.0051 


0 


10 


23 


2 


1 


17 


21 


15 


02 


e 


VS Mi 


5 


2BB 


40.40 


55.50 


0.40 


0.01 


12.00 


0.2544 


1.7072 


1.7072 


0 


it 


23 


15 


2 


21 


17 


15 


1 


00 




4 


•7 


57.72 


53.50 


13.04 


- 


- 


- 


. 


• 
























5 


42 


72.47 


55.17 


10.41 


- 


- 


- 


• 


• 
























1 


517 


4B.lt 


14.05 


4.50 


0.02 


30.04 


-0.5452 


1.3733 


i..;33 


0 


21 


2 


1 


22 


15 


17 


10 


23 


• 




2 


3B7 


40.71 


14.07 


4.47 


0.24 


32.17 


-0.5755 


0.7470 


0.7143 


0 


21 


20 


2 


15 


17 


1 


23 


10 


e 


HI 1 


3 


2M 


40.45 


14.54 


4.57 


- 


- 


- 




























4 


•7 


57.72 


15.00 


4.32 


- 


- 


- 


• 


• 
























S 


42 


72.47 


14.21 


4.45 


- 


- 


- 


- 


• 
























1 


517 


40.lt 


5.57 


4.07 


0.20 


30.50 


-1.0105 


• I.OIH 


0.2400 


0 


21 


1 


22 


2 


15 


10 


2^ 


17 


• 




2 


3B7 


40.71 


10.10 


4.55 


0.55 


54.47 


-1.4404 


-1.4404 


•0.02#4 


0 


20 


1 


21 


2 


15 


22 


17 


10 


23 


FSI 2 


3 


2B0 


40.45 


10.33 


5.14 


- 


- 


- 


- 


- 


. 


- 


- 


- 


. 


• 


• 


• 


• 


. 




4 


•7 


57.72 


5.01 


4.70 


• 


- 


. 


• 


. 
























5 


42 


72.47 


5.50 


4.35 


- 


- 


- 


- 


- 
























1 


517 


40.lt 


0.43 


3.70 


0.50 


30.52 


-1.4151 


1 .4253 


1.4253 


0 


15 


21 


1 


23 


17 


22 


e 


- 


• 




2 


3B7 


40.71 


0.43 


3.77 


0.00 


43.10 


-1.0451 


-1.0451 


0.4302 


0 


15 


21 


1 


17 


22 


23 


20 


10 


2 


FBI 3 


3 


2B0 


40.45 


0.51 


5.05 


- 


- 


. 


- 


- 
























4 


•7 


57.72 


0.37 


3.50 


- 


- 


- 


- 


- 
























5 


42 


72.47 


10.02 


5.70 


; 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 




1 


517 


40.lt 


12.50 


5.07 


0.24 


34.35 


-1.1721 


1.7121 


1.4775 


0 


15 


21 


2 


1 


10 


17 


23 


22 


• 




2 


3B7 


40.71 


12.75 


3.52 


0.75 


30.00 


-1.7524 


-1.7524 


0.0771 


0 


15 


21 


2 


17 


20 


1 


10 


22 


23 


PSI 4 


3 


2B0 


40.45 


12.03 


5.00 


. 


- 


- 


- 


- 
























4 


S7 


57.72 


11.40 


3.52 


- 


• 


- 


- 


- 
























5 


42 


72.47 


14.01 


3.44 


- 


- 


- 


- 


- 
























1 


517 


40.lt 


04.77 


10.01 


0.10 


10.41 


0.7442 


1.2325 


1.2329 


1 


21 


22 


23 


2 


17 


15 


0 


10 


• 




2 


3t7 


40.71 


05.10 


15.15 


<0.02 


13.45 


0.1755 


0.0700 


0.0700 


1 


20 


21 


17 


23 


15 


2 


22 


10 


0 


PPVT IQ 


3 


2t0 


40.45 


04.13 


15.44 


- 


. 


. 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 




4 


S7 


57.72 


05.04 


24.07 


- 


- 


- 


- 


- 
























5 


42 


72.47 


71.74 


14.51 


- 


- 


- 


- 


- 
























1 


517 


40.lt 


100.02 


27.40 


0.04 


7.12 


0.4541 


0.0904 


0.0504 


21 


2 


0 


17 


15 


1 


23 


22 


u 


- 




2 


3t7 


40.71 


100.33 


27.52 


<0.02 


9.05 


0.1154 


o.oao 


0.0400 


21 


17 


2 


22 


0 


15 


1 


73 


10 


20 


VSMS SQ 


3 


2t0 
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27.77 
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S7 


57.72 
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5 


42 


72.47 
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Not«i: (1) An ait«riik d«not«i analyiii Urminat«d b«for« all variabUi w«r« «nt«r«d (■•• text). 
(2) Variable namei and numberi are ai follow! : 



Number Name 



1 


Race 


2 


Sex 


6 


Week! 


8 

15 




16 


P-Exp. 


17 


Hour! 


18 


Center Sixe 


19 


Population 


20 


Father'! Education 


21 


Mother*! Education 


22 


Family Size 


23 


Income 



(5) Variable! Income, P-Exp, , and PS-Exp. are omiltet! here for convenience. The n- were no !ienificant beta! for Income or P-Exp., and only one negative one for PB-Exp. for 
VSMS in Group L 
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largest t-value^ when each variable was entered by itself into a regres- 
sion equation with the variables from the preceding step. For example, 
in the analysis of the PPVT RS (raw score) with Group 1, Variable 1 
(Age) was the first variable to enter after Variable 0 because, of the 
nine variables considered, it had the largest beta coefficient t-value 
(and therefore F-ratio, since = F) in a E-variable regression equation 
employing weeks. Variable 3 (Race) was selected next because, of the 
remaining eight variables, it had the largest beta weight, t-value in a 
3-variable regression equation employing weeks and age. An asterisk 
after the last variable entered means that some remaining variable or 
variables never entered the analysis at all since the criterion values for 
selection were too low. 

Finally, for each analysis, an indication is provided of those 
variables whose final beta coefficient had a t-value corresponding to a 
probability of 0.05 or less. In addition, if the beta coefficient was 

negative, this is indicated with a minus sign. 

For ease of reference, the mean and standard deviation of each 
independent variable for each group is presented in Exhibit 46. 

c. Intertest Relationships 

One way of addressing the question of whether the 
tests that were used provide different information about Head Start 
children is to examine the intertest correlations. Tests that are highly 
intercorrelated do not add unique information about the characteristics 

or performance of the children. 

From the multivariate By- Center covariance analysis, we ob- 
tained the intertest product -moment correlation coefficients shown on 
the lower left half of the matrix in Exhibit 47. Product-moment coeffi- 
cients from the multiple regression analysis are shown in the upper 
right half of the exhibit. Thus, for example, based on center average 
scores, the PPVT raw scores correlated 0.86 with the PSI total raw 
scores in the covariance analysis. These intercorrelations are of 



^Actually, the selection criterion was an F-value; however, F = t . 
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EXHIBIT 46 - MEANS AND STANDARD DEVIATIONS OF INDEPENDENT 

VARIABLES BY GROUP 









Group 


Group 


Group 


Group 


Group 


variable 




One 


Two 


Three 


Four 


Five 


1. 


Age in 


X 


60.18 


60.71 


60.65 


57.72 


72.67 




Months 


a 


8.71 


8.67 


8.27 


7.01 


7.94 


2. 


Sex (Pro- 


X 


0.49 


0.49 


0.50 


0.46 


0.45 




portion of 


CT 


0.50 


0.50 


0.50 


0.50 


0.50 




Males) 












3. 


Race (Pro- 


X 


0.58 


0.53 


0.48 


0.62 


0.00 




portion of 
Non- Whites) 


CT 


0.52 


0.53 


0.54 


0.49 


0.00 


4. 


Weeks 


X 


19.90 


19.86 


21.20 


17.66 


12.79 






cr 


7.45 


7.15 


6.99 


5.21 


7.12 


5. 


Hours 


X 


3.79 


3.79 


3.90 


4.25 


3.31 




Per Day 


CT 


1.32 


1.28 


1.41 


1.48 


1.14 


6. 


Center Size 


X 


2.61 


2.60 


2.80 


2.83 


4.81 




(Number of 
Classes) 


CT 


1.43 


1.51 


1.56 


1.50 


2.89 


7. 


Population 


X 


5.14 


5.01 


5.16 


5.44 


5.14 




(Log of 
Town Size) 


a 


1.22 


1.27 


1.18 


1.31 


0.76 


8. 


Mother's 


X 


5.21 


5.25 


5.28 


5.02 


3.36 




Education 


CT 


1.11 


1.10 


1.06 


0.99 


1.56 


9. 


Father's 


X 


•n 


5,02 


5.05 


•H 






Education 


CT 


- 


1.31 


1.30 


- 


- 


10. 


Family 


X 


6.69 


7.11 


6.96 


6.23 


7.02 




Size 


a 


2.25 


2.51 


2.47 


2.42 


1.69 
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course contaminated or inflated by the intercorrelation of other vari- 
ables associated with center average scores. 



EXHIBIT 47 - 



INTERTEST PRODUCT- MOMENT CORRELATION CO- 
EFFICIENTS FROM BY- CENTER MULTIVARIATE 

COVARIANCE ANALYSIS AND MULTIPLE REGRESSION 
ANALYSIS 



PPVT 

PSI 

BI 

VSMS 


PPVT 


PSI 


BI 


VSMS 


0.86 

0.23 

0.36 


7^ 

o o o 

_ 


0.14 
0.0b ® 


0.29 
0.33 
, 0.15 



In Exhibit 48 are shown the least squares estimates of the inter - 
test covariances based on center averages (with the effects of age, sex, 
race, etc. , removed) which were used along with the residual variances 
to estimate the residual correlation coefficients; these are given in the 
upper right half of the matrix on the basis of the regression model and 
in the lower left half on the basis of the analysis of covariance model. 

EXHIBIT 48 - INTERTEST RESIDUAL CORRELATIONS 



PPVT 

PSI 

BI 

VSMS 


PPVT 


PSI 


BI 


VSMS 


0.70 

0.18 

0.14 


^ 0.7 5 

0.12 

0.18 


0.2B 
He 0-26 

0.03 1 


0.17 

0.18 
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Thus, only the PPVT and PSI have much common variance. In 
effect, they share around 50 to 55 percent of their variance. The other 
two instruments, which are based on information about children pro- 
vided by teachers or aides, have relatively little variance in common 
with each other or with the individual tests. 

The loss in intertest correlation that occurs when the effects of 
various independent variables are partialled out is readily understand- 
able when the partial correlation coefficients of the independent vari- 
ables with each of the tests are examined. From the By- Center multi- 
variate regression analysis, they are shown in Exhibit 49. 

EXHIBIT 49 - PARTIAL CORRELATION COEFFICIENTS OF THE 

INDEPENDENT VARLA.BLES WITH EACH OF THE 
FOUR TESTS 



Independent Variables 


Tests 


Age 


Sex 


Race 


Population 


Poverty 


Exposure 


PPVT 


0.59 


0,07 


-0.35 


-0.06 


-0.20 


-0.09 


PSI 


0.74 


-0.08 


-0.21 


-0.15 


-0.13 


-0.10 


BI 


0.17 


0.02 


-0.05 


0.26 


0.06 


-0.08 


VSMS 


0.34 


-0.08 


0.12 


0.01 


-0.13 


0.06 



The inter cor relation of the BI with the other tests was relatively 
unaifected by removing the effects of the independent variables, since 
it had little correlation with these variables. Removing the age variable 
from the intercorrelation of the VSMS with the PPVT and the PSI, on the 
other hand, reduced their intercorrelation markedly, since the VSMS 
has a sizeable correlation with age.^ On the other hand, the PPVT and 
PSI show a high intercorrelation even with the age variable removed, 
suggesting that they do indeed measure similar functions or processes, 
or at least that one is a fairly good predicator of the other. 

^The correlations of test scores with age are probably themselves some 
what inflated because of the range of ages included for analysis. 
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The Pre-School Inventory subtests were not examined in the By- 
Center analyses; that was done only in the By-Child analyses. To 
give an indication of the observed relationships of the PSI subtests to 
each other and to other instruments, residual correlation coefficients 
were calculated for Group 1 (N = 517). The correlations eliminate the 
inter correlations with the 10 independent variables. The residual cor- 
relation coefficient estimates are shown in the upper right half of the 
matrix in Exhibit 50; for comparison, the product-moment correlation 
coefficients for the same data are given in the bottom left half of the 
exhibit. 

EXHIBIT 50 - RESIDUAL CORRELATION COEFFICIENTS ESTIMATES 

AND PRODUCT-MOMENT CORRELATION COEFFI- 
CIENTS FOR PSI SUBTESTS AND OTHER TESTS FOR 
GROUP 1 (N = 517) 




These coefficients in Exhibit 50 are based on a different model 
and sample from the By- Center coefficients. They include in the coef- 
ficients whatever variations there are in intertest correlation from 
center to center. For convenience, the product-moment and partial 
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correlation coefficients of the 10 independent variables with each de- 
pendent variable for Group 1 are given in Exhibit 51. 

In terms of magnitude of relationships between the coefficients 
within and between models (By- Child and By- Center), the following 
statements are appropriate; 

• > By- Child coefficients of both kinds (product-moment and 

partial) are typically larger than the corresponding By- 
Center coefficients of either kind, 

• Product-moment correlations within models are typically 

S 

(in 80 percent of the cases) larger than the corresponding 
partial v-or relation coefficients, (Exceptions in all but 
one case involve the BI, ) 

Thus, the partial correlation coefficients of the By- Center model 
appear generally to give the most conservative or minimal estimate of 
intertest correlations. 



EXHIBIT 51 - CORRELATION OF INDEPENDENT AND DEPENDENT VARIABLES FOR GROUP 1 



PRC R-886 
123 



Income 


0.05 

0.05 


O O 

• • 

o o 


0.04 

0.02 


0.06 

0.04 


-0.01 

-0.01 


0.02 

0.02 


0.07 

0.07 


vO vO 

o o 
« • 

o o 


Family- 

Size 


0.01 

-0.07 


0.04 

-0.06 


-0.06 

-0.06 


0.02 

-0.01 


-0.01 

-0.07 


-0.03 

-0.12 


0.11 

0.03 


0.09 

0.02 


Mother' s 
Education 


0.04 

0.19 


I 0.03 

1 0.22 
1 


0.11 

0.12 


0.07 

0.12 


0.03 

0.17 


0.05 

0.19 


0.00 

0.17 


0.01 

0.18 


Population 


-0.30 

-0.04 


-0.38 
1 -0.13 


o o 

• • 

o o 


-0.06 

0.03 


-0.28 

-0.06 


-0.30 

-0.07 


-0.42 

-0.18 


-0.37 

-0.15 


Center 

Size 


-0.09 

-0.02 


-0.14 

-0.07 


-0.02 

-0.02 


-0.16 

-0.14 


-0.10 

-0.03 


-0.11 

-0.03 

... 


-0.14 

-0.07 


-0.16 

-0.10 


Hours 


-0.09 

-0.06 


o o 

• • 

o o 


l 

-0.03 

-0.03 


-0.06 

-0.05 


-0.08 

-0.04 


-0.03 

0.02 


-0.09 

-0.05 


-0.12 

-0.08 


Weeks 


1 

o o 

1 * * 

o o 


-0.05 

0.06 


o o 

• • 

o o 


0.00 

0.05 


-0.02 

0.06 


o o 

• • 

o o 


90*0 

90*0- 


-0.05 

0.07 


Race 


-0.34 

-0.24 


-0.30 

-0.14 


-0.02 

-0.04 


r-H 

o o 

• • 

o o 


-0.20 

-0.06 


-0.26 

-0.14 


-0.33 

-0.15 


-0.29 

-0.12 


Sex 


-0.01 

-0.05 


90*0 

80*0 


0.11 

0.10 


0.07 

0.05 


0.15 

0.14 


CO 00 

o o 

• • 

o o 


0.03 

0.00 


0.15 

0.15 


Age 


0.48 

0.41 


0.59 

0.51 


0.09 

0.12 


0.25 

0.23 


0.51 

0.45 


0.50 

0.44 


0.56 

0.44 


0.53 

0.43 




1 ^ (M 


PM 

PC 


S u 

di di 


PM 

PC 


PM 

PC 


PM 

PC 


PM 

PC 


PM 

PC 


Test 


PPVT 


PSI 


BI 


VS MS 


r-H 

HH 

di 


r4 

HH 

S 


CO 

HH 

S 


i— 1 

di 



o 

ERIC 



Notes: (1) PM denotes product-moment correlation. 

(2) PC denotes partial correlation. 
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V. DISCUSSION 



This section will discuss the results reported in the preceding 
section. There are three major areas of consideration: (1) the main 

hypothesis, (2) alternative hypotheses, and (3) comments on exceptions 
to the main results. 

A, The Main Hypothesis 

The principal result of this study may be stated as follows: there 

was no statistically reliable evidence of a treatment effect observed for 
the main samples of children tested, based on measures utilizing total 
test raw scores. Neither a systematic gain with time nor loss with time 
was found, nor was there evidence of a significant but non-linear 
relationship. 

This conclusion rests on four criterion statistics: 

• Wilks' Lambda for the covariance analyses, 

• The F- ratios for individual instruments in the covariance 
analyses, 

• The t- values (or confidence intervals) for the p coefficient 
for the exposure variable in the regression analyses, 

• Proportions of test variance attributable to exposure time. 

The covariance analyses provide the most powerful bases for 

statements about treatment effects. It is true that the By- Child analyses 
of covariance are of limited value, since the universe of these subgroups 
would be of indeterminate generality and of limited operational interest 
even if there had not been losses (sometimes severe) owing to missing 
family data. Furthermore, the discriminating power of many of the 
By- Child covariance analyses is rather low, owing to the small sample 
sizes. However, the primary By- Center covariance analyses, whether 
for the total eligible sample of English- tested children or for the total 
eligible sample, can be interpreted with more confidence. Neverthe- 
less, for main and subsamples, the result is negative. This lack of 
evidence of a relationship between program duration and performance 
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(or rating) of child applies whether the measures are considered jointly 
or (except in one case for the Behavior Inventory) individually. 

The regression analyses do not bear directly on the question of a 
treatment effect, since the treatment or exposure variable is not kept 
independent of other variables in these analyses. However, to the ex- 
tent that the p coefficient for the exposure variable is significantly 
different from zero, or has a confidence interval that does not include 
zero, there is presumption of an effect. Only one group--the eligible 
children tested in Spanish- -had an exposure variable (Weeks) with a 
significant p coefficient, and it was negative for the two measures 
analyzed (the PPVT and PSI). Furthermore, except for the group of 
Spanish- speaking children, the percentage of variance in the test scores 
accounted for by the exposure variable was almost never more than 2 
percent, and, in the By-Child regression analyses it was typically less 
than 1 percent on the basis of our procedure. 

As described in subsection IV. B, the attempt was made to re- 
place Weeks or program length with the total number of presumed hours 
in the program as a measure of exposure. In that case, too, the p 
weight for exposure was not significant. 

These considerations lead to two major questions that warrant 
further comment: 

1, Assuming that Head Start programs can produce measurable 
cognitive, social and emotional effects in children, what factors or 
hypotheses may account for the lack of evidence for them in the present 
study ? 

2. How can those few cases of significance that did occur be 
accounted for ? 

These two questions are discussed in that order below. 

B. Alternative Hypotheses 

What may have obscured our observation of treatment effects? 

The question can be addressed by consideration of a number of alter- 
native hypotheses. 
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It should be made clear from the outset that any discussion of ef- 
fects involves only consequences measurable by the tests employed. 

We are not concerned with medical or dental effects, for example, 
although these may well have been manifold. Let us also clarify what 
is meant by an effect, or at least the evidence for one. We cannot 
speak of gain scores, or changes in a child's score from the beginning 
to the end of the program. As the experiment was designed, the Short- 
term (S) group is the control or comparison group. The primary 
evidence for an effect is a difference between the means of S and either 
or both Medium-term (M) or Long-term (L) samples for the dependent 
variables considered together or individually. There are certainly 
other indicators, as noted above, although they may be less compelling, 
both in terms of power and operational significance. For example, 
there may be shifts in variability with no change in means. There may 
be consistent relationships between the means or medians of the samples 
(e, g, , 1;>M>S) even though the differences are not significant by a 
parametric test. There may be p weights in a multiple regression 
equation that are significantly different from zero, lending credence to 
the hypothesis of an effect associated with a treatment variable. How- 
ever measured, whenever a treatment level distinction (S, M, L) is re- 
tained, an effect in this study is basically a score change relative to 
the S level. We have avoided identifying effects with a particular psycho- 
logical construct or product. For the time being, an effect is defined 
purely operationally. 

With this background, let us consider possible obscuring factors, 

1. Non- Comparability of Samples on Essential Uncontrolled 

Variables 

The most compelling of these variables would be starting 
level of performance. There is no direct rejoinder to the hypothesis 
that L children, for example, had lower starting levels at the beginning 
of the program than the S children. (There is no evidence to support 
that hypothesis, either. ) Our covariance adjustments for age, sex, race, 
etc, , in effect made the three samples uniform with respect to those 
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variables. However, demonstrable equalization of entrance abilities 
or achievements could only have been accomplished by pre-testing or 
by randomization of assignment of children to treatments. Of these 
two methods, the latter would have been the more satisfactory and the 
less possible. (Actually, of course, neither method was possible.) 

No argument concerning the actual or statistical matching of 
control variables such as age, sex, number of siblings, mother's 
educational level, family income, and so on, can directly resolve the 
question of entry comparability of the children in this study. However, 
the logic of the argument for non- comparability is slippery. If, for 
example, there were a systematic incremental effect of the program, 
then our results suggest that there would have to have been a progres- 
sive decrease in entry level as a function of length of program. It is 
hard to imagine what would produce such a selection process. 

Selective attrition during programs could have produced the same 
result. An attempt was made to ascertain whether there was some 
correlation between the length of the program, defined by the average 
length of attendance in weeks of a center's sample of children, and the 
retention rate of the center. Retention rate is defined as the ratio of 
the number of children attending the program at the time of the testing 
who had been in the program at its start to the total number of children 
registered at the commencement of the center's program. 

There were great difficulties in obtaining the data necessary for 
these measures in many cases, owing to various administrative prac- 
tices of the programs. Although there appeared to be some positive 
relationship in the rank order correlation between these two variables, 

it was not significant ( r = 0.16; t = 1.33; df = 67; p >0.05). Thus, 

0 

available evidence does not generally support the hypothesis of a 
relationship between program duration and attrition. 

Another selection bias which could have had the same effect as 
experimental mortality was the absence of children, otherwise in 
attendance, during the week of testing. There was one case, for ex- 
ample, in which the center was more than decimated by an outburst of 
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miimpi and chickenpox while the tester was there. However, we have 
no practical way of addressing this selection hypothesis. 

In sum, we can neither accept nor reject this possible obscuring 
factor. It was hoped that the sampling plans used would minimize such 
a sampling bias; we cannot, however, demonstrate by data or design 
its non-existence. 

2. Inadequate Sample Sizes 

It is entirely possible that there were incremental differences 
between S, M, and L samples that did not show up owing to the limited 
number of centers and children. There is little doubt that sufficiently 
large samples will generally produce significant differences. However, 
the differences are apt to be so small as to be operationally meaningless. 

To provide an indication of the magnitude of differences we could 
have expected to find, the simultaneous 95 percent confidence limits for 
differences between adjusted means for the PPVT and the PSI were calcu- 
lated. These intervals, shown in Exhibit 52, are listed for the main By- 
Center group, and for the By- Child groups with the smallest (3 -year- old 
non-white female) and largest (4-year-old non-white male) total sam- 
ple sizes. 

From Exhibit 52, it can be seen that, for example, we could 
reasonably expect to detect L— S PPVT raw score differences of more 
than about 4 points (the half-interval length), or L-S PSI differences of 
about 4-2/3 points or more. The sensitivity of the observations de- 
creases drastically, of course, with a reduction in sample size, to the 
point (exemplified by the smallest By- Child group) at which differences 
that could have been detected with assurance are so big as to be 
meaningless. 

The intervals shown in Exhibit 52 were, as stated above, calcu- 
lated for all contrasts taken together. Thus, they are conservative. 
Confidence intervals for the difference between one pair of means taken 
individually (e.g., the L-S PPVT difference) can be obtained by dividing 
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the kO’ parameter by 




Thu«, for the L-S PPVT com- 



\ V / 

parison we should expect to detect differences larger than approximately 
3.1 points. Similarly, with the L-S PSI comparison, we should anticipate 
differences at the 0.05 level that are greater than about 3.7 points. The 
real issue is not basically whether the sample size was adequate; the 
issue is the meaning of adequate --i. e. , what magnitude of difference 
it is meaningful to try to detect. It is not possible to take a stand on 
this issue here. However, the ability to stipulate goals in this form 
surely chould be one of the major objectives of educational and psycho- 
logical research efforts. 



3. Non-Uniform Treatments and Effects Among or Within 
Samples 

Obviously, teachers, children, procedures, and facilities 
or environment varied from class to class as well as from CDC to CDC. 
Objectives and goals probably showed similar diversity. Consequently, 
whatever the specific nature of a Head Start center program, the "treat- 
ment" perforce varied at least from classroom to classroom. The re- 
sults of the treatment undoubtedly varied from child to child. The issue 
is why the net result of the infinite specific and different treatments 
would be zero. 

The question is complex, and clearly unanswerable without re- 
course to specifics about treatments and effects--a specificity far 
beyond the available data in this study. Indeed, the specification and 
measurement of treatments in child development and in education is 
still one of the most difficult problems in educational research.^ Par- 
ticularly in pre-school programs, the treatment variables are difficult 
to define except on a gross level and more difficult to measure reliably. 



See Gage, N. L. (ed. ), Handbook of Research on Teaching. Chicago: 
Rand McNally, 1963, passim. 
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There are undoubtedly interaction! of treatment! and !ubject!, 
however treatment ie defined. Whether we think of variable! a! teacher! 
or ae center program! (or both), different level! (individually or in 
combination) very likely interact with pupil variable!. Since we did not 
(indeed, could not) deeign our eample to provide obeervatione covering 
a known range of teacher-program- child variables, it could be argued 
that there were no apparent effects because there was an inadequate 
number of "positive” or "optimum" pupil-treatment observations.^ 
Perhaps if we had used classrooms as the basic sampling unit, and had 
tested all children in each sample classroom, we would have had suffi- 
cient numbers of specific teacher-program-pupil combinations at differ- 
ent levels to have seen an effect. The net effect would presumably re- 
sult from the occurrence of the differential effects. The theory and 
observation of a teacher (and/or program) - child interaction is con- 
side red fundamental not only in education, but in related enterprises 
such as psychotherapy,^ 

The problem here is not whether there are treatment- subject 
interactions, but what the net effect of treatments may be. For the 
design of this study, an effect, measured for instance by a difference, 

I 

A , in S and L means (i. e. , A= L - S ), can have one of three values: 

I 

A is positive. 

A is negative. 

A is, in effect, zero. 



T“ — 

Validity and sensitivity of the instruments are assumed for purposes 
of this argument. 

2 

See Gage (op. cit. ) passim. 

3 

See, for instance, Kiesler, Donald J. , "Some Myths of Psychotherapy 
Research and the Search for a Paradigm," Psychological Bulletin, 1966, 
Volume 65, pp. 110-136. 
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Let us assume, for purposes of discussion, that the experimental 
groups were comparable in starting level and that the two tests (PPVT 
and PSI) are each appropriate for measuring cognitive status or achieve- 
ment level of the children. The measures themselves depend upon the 
responses of the subject to some partially known stimulus. The responses 
are binary (from the experimenter's point of view, a response is either 
right or wrong), and there is the usual confounding of cognition and moti- 
vation, which (in highly oversimplified and non-operational terms) is 
something like this: 





Motivation 


Cognition 


Willing 
to Respond 


Unwilling 
to Respond 


Able to Respond 


Correct 


No Response or 




Response 


Wrong Response 


Unable to Respond 


Wrong Response 
or No Response 


No Response 



Here, the term "able" means "has the requisite knowledge and 
skills," while "willing’’ means "attempts to make best response possible 
according to perceived requirements (or rules) of the game (test)," 

As the relationships are depicted here, only the correct response has 
unambiguous meaning if we ignore the role of chance. In this study there 
were no specific criteria of motivation beyond the finding of the tester 
that the child was "testable" in the sense that he would stay in the situ- 
ation and respond at all. 

The effect of participation in a Head Start program could be in 
either or both realms and produce the same result. Conceptually, how- 
ever, the action of a treatment may differ according to the area or 
realm of effect. That is, it is reasonable to conceive of cognitive ef- 
fects as having a zero point and increasing in magnitude, complexity, 
scope, etc. It is more difficult to conceive of the cognitive effects as 
bi-polar, with changes (losses) occurring as a result of participation 
in a program. On the other hand, motivational effects could easily be 
positive or negative, and could interact with cognitive effects in a 
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variety of ways, uniformly or selectively (i. e., in terms of individuals 
or subgroups). However, regardless of what sort of treatment-pupil 
interaction is assumed, and regardless of what sorts of intereffect 
combinations are assumed, the absence of an observed treatment effect 
(i.e,, a difference in means) suggests that: 

• There was no measurable effect in the cognitive or cognitive- 
motivational realms,^ 

• Positive effects in the cognitive area were nullified by op- 
posing effects in the motivational area, 

• Positive effects in the motivational area were nullified by 
negative effects in the cognitive area. 

Another alternative, that positive effects in the motivational area 
were not accompanied by positive effects in the cognitive area, is 
tenable if, for example, it is assumed that S, M, and L samples were 
systematically less developed in the cognitive and that the positive 
motivational effect simply maximized the use of otherwise unaffected 
knowledge and skills. 

Any of the above hypotheses is possible, regardless of whether 
assumptions of uniformity or diversity of treatments and subjects are 
made. If one accepts the interaction point of view-- that the major 
effects depend on the interaction of subjects and treatments--PRC's 
results suggest either (1) that there were too few optimum combinations 
to make a measurable difference (a rareness or scarcity that does not 
bode well from the point of view of matching teacher selection, training, 
program structure and content, teacher behavior, or whatever manip- 
ulable variables are considered the effective dimensions of a treatment 
with the appropriate pupil variables), or (2) that there were as many 
and as strong negative combinations of treatments and subjects as 
there were positive ones. 



This hypothesis in no way rules out the possibility of major effects 
occurring with all or many children early in their participation in a 
program (e,g,, during the first 1 to 6 weeks). 



o 

ERIC 
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.^appro priate Instruments or Teste 

One major reason that effects which actually existed may not 
een noted centers around the measuring instruments. The instru- 
ments may not have been appropriate for a number of technical and con- 
ceptual reasons. 

The study was not designed to test hypotheses about the extent to 
whmn the tests were suitable or appropriate. However, the question of 
instrumentation is a major one in Head Start evaluation, as in other 
e ucational and social programs; it is thus appropriate to make what 
comments we can in this area. The tests will be discussed in terms of 
va 1 ity and reliability, not so much in the usual manner of providing 
coefficients of systematically collected correlational data, but by draw- 
ing on the data and inferences reasonably bearing on these properties. 

Validity 

From a program point of view, validity depends in 
part on objectives and goals. Head Start, like many compensatory pro- 
grams. is careful not to impose national goals which are too tightly 
defined. The following remarks, taken from a report submitted to the 

House Education and Labor Committee in 1966.^ illustrates the variety 
implicit in Head Start programs; 

THE NATURE OF A HEAD START PROGRAM 

®i^®® shapes. Some 

are three hours a day--others for a full day. The great 

children have, thus far, been en- 
rolled in SUMMER programs. This report is focused on 

expanded to FULL- 

YEAR programs. Some of these would operate nine months 
a year--others up to twelve. 



17 “ 

Bernard L. Boutin. Acting Director. Community 

Printed °f Economic Opportunity. 29 March 1966. 

"• .Supplemental Approp riation Bill. 1967 ; Hearings b efore 

C°”^”^^ttee on Ap p roWiations. House of Rep TjT 
sentatives , b9th Congress, pp. 141-15C ^ 
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Local needs and local decisions are the determining factors 
in developing and operating the program with one important 
exception. The Economic Opportunity Act requires that 
programs show promise of making a meaningful contribution 
to the elimination of poverty. To this end OEO insists that 
all programs must contain those elements of quality which 
are believed essential to really helping the child and his 
family. Programs devoid of content and purpose, programs 
aimed solely at permitting parents to work, and programs 
concerned exclusively with educational development cannot 
be funded under Head Start for none of these types shows 
sufficient promise of helping to end poverty. 

Head Start programs are carried out in Child Development 
Centers which: 

Provide help to both the child and his family. There 
must be the widest possible opportunities for parents 
to participate in program decisions and operations 
and to themselves be beneficiaries of the programs. 

Offer a comprehensive range of services which are 
critical to the child's development including medical 
and dental care, social services, nutritional support 
and a well designed program of daily activities de- 
signed to meet the needs of the individual child. 

Are a true community project involving cooperation 
among the professional and non-professional staffs, 
parents of the children, professional individuals and 
organizations serving children, government agencies, 
and ordinary citizens--young and old--who are able 
and willing to volunteer their time and skills. 

The particular methods and activities needed to accomplish 
these objectives cannot be specified by OEO. It can, and 
does, do everything in its power to advise communities of 
methods and activities which have shown promise in other 
communities. 

It is clear from this statement that the educational and develop- 
mental goals of individual programs are not fixed. There is no indicated 
requirement to improve communication skills, develop school readiness, 
improve social skills or responsiveness, or foster any specific behavioral 
change. Nevertheless, the programs have typically an educational- 
developmental tone or appearance. Consequently, measures applied to 
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program evaluation ought to include variables related to these areas 
of endeavor and/or behavior. The PSI was designed to have content 
validity with respect to achievement expected by teachers of children 
entering school. The VSMS ideally should have content validity for 
some aspects of personal- social development. The BI contains a num- 
ber of descriptive scales that appear to suggest the kind and direction 
of behavior which teachers probably like to encourage. The PPVT 
presumably taps a set of skills involved in verbal communications. 

From a program point of view, then, the tests at least seem to have 
relevance, or face validity. 

One criterion of validity is sensitivity of scores to those variables 
to which one would normally expect them to be sensitive. For example, 
the PPVT, PSI, and VSMS should all be sensitive to age, and they are. 
The correlation of these tests with age was higher than with any other 
variable (see Exhibits 49 and 51), and the beta weight for age was 
significant regardless of subgroup (see Exhibit 45). The individual 
test scores (PPVT and PSI) correlate with race, with whites tending 
to score higher than Negroes, although the coefficients are not as high 
as for age. All four measures appear to be sensitive to the level of 
education of the child's mother, as would be expected, and the PPVT 
and PSI are related also to the level of the father's education (see Ex- 
hibit 45). However, the BI and VSMS appear to be entirely unrelated 
to that latter variable. There is some indication that the tests (other 
than the VSMS) are slightly correlated negatively with family size, as 
might be expected.^ While the beta weight for poverty is not significant 
in the By- Center regression analysis (see Exhibit 41), the signs of the 
partial correlations of the PPVT, PSI, and VSMS are at least in the 
right or expected direction (see Exhibit 49), as are the signs of the 
coefficients for income and family size in the By- Child regression 
analysis (see Exhibit 51). There is finally some slight indication of 

^Anastasi, Anne and Rita Y. D'Angelo, "A Comparison of Negro and 
White Preschool Children in Language Development and Goodenough 
Draw-a-Man IQ, Journal of Genetic Psychology * Vol, 81, 1952, 
pp. 147-165. 
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■ enaitivity of teat acorea to aex, although it appeara aomewhat complex 
(aee, for example, Exhibit 51). Thua, at leaat the PPVT, PSI, and 
VSMS appear to be aomewhat aenaitive to variablea to which one would 
expect them to be aenaitive.^ 

The BI generally appeara to correlate little with any of the vari- 
ables atudied. There is indication of aome poaitive correlation with 
age. There is yet slimmer evidence of a correlation with sex, although 
boys tended to be rated slightly higher than girls. In the case of the 
By-Center multivariate regression analysis, the beta coefficient for 
population in the BI equation was significant and positive (Exhibits 40 
and 41). However, the variables to which the BI should be sensitive 
are not immediately apparent. 

Both the VSMS and the BI depended on information provided by 

the teacher or teacher aide about the child. Thus, Uiere is an additional 

2 

problem of validity in these instruments. We have no independent 
criterion or measure of validity of teachers* ratings of children on BI 
scales. We do know that many teachers found the BI difficult to com- 
plete, and that aome did so hastily, while others deliberated each scale 
at length. In VSMS interviews, the teeters sometimes found teachers 
indicating that the children were much alike in abilities, or stating 
ignorance of much of the child’s capabilities. In a few cases, testers 
noticed that an individual qkild could not in fact do what the teacher had 
said that he could. However, at least in Vineland interviews testers 
were able to make some checks on the probable authenticity of the 
information. 

One indication, however, that the BI may be somewhat sensitive 
to actual behavior of the children (as seen, interpreted, converted, and 

i 

rated by teachers), comes from an attempt to examine stereotypes. We 
asked 15 teachers of the deaf in a summer course to imagine and rate 
on BI's: 

^In interpreting the various correlations, it is necessary to bear in mind 
that Head Start children are a selected sample whose range of variation 
on a number of variables is restricted. 

2 

See Appendix D, Tests. 
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• A dii&dvantaged 5- year- old boy. 

• A non- disadvantaged 5-year- old boy. 

• A disadvantaged 5-year-old girl. 

• A non- disadvantaged 5-year- old girl. 

The mean "scores” are shown below in Exhibit 53. 

EXHIBIT 53 - MEAN BEHAVIOR INVENTORY SCORES FOR HYPO 

THETICAL CHILDREN 





N on- Di sadvantaged 


Disadvantaged 


Boy 


152.80 


109.47 


Girl 


149.53 


114.80 



Where the mean scores for the imagined disadvantaged children 
tended to favor the girls, the obtained BI scores tended to be higher 
for boys. More striking, however, is the comparison of the means 
of the hypothetical children with the obtained means for 5-year-old 
males and females (see Exhibits 21A, 21B, and 21C). The means for 
the imagined non- disadvantaged children are exactly in the range of 
obtained unadjusted means for 5 -year-olds, as the exhibits show. In 
no case did any of the S, M, or L 5-year- old groups have a BI mean 
as low as 115. This suggests that, if the hypothetical scores reflect 
stereotypes of some generality. Head Start teachers may have been 
more discriminating and rated the children more consistently with 
their actual behavior. 

A similar argument comes from a comparison of BI scores ob- 
tained in the summer 1965 Head Start research evaluation effort.^ For 
the sample obtained, the post-test mean for 5-year-olds was 127.82. 

1 — 

Planning Research Corporation, PRC R-795, Results of the Summer 

1965 Project Head Start, Volume I, Cort, H. Russell, Jr., et al, 

9 May 1966. (See Exhibits 1-14 and 1-15. ) 
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While that ie eomewhat lower than moat of the obtained BI meant for 
5*year-olda in thia atudy, it ia atill higher than the hypothetical child 
meant. 

No data bearing on the reliability of the Bl ia available. We did 
not do a aplit-half or odd-even correlation analyaia of it, nor did we 
have the opportunity to do a re-teat. A factor analyaia of the BI waa 
performed, uaing the nine aubteata at variablea; however, the reaulta 
were not informative and will not be diacuaaed here. 

The raw acorea from all inatrumenta were examined for tetter 
effecta. The hypotheaia that the tetter affecta the acore on a teat waa 
teated by a one-way analyaia of variance of individual teat raw acorea 
for each of four age levela. The treatment variable waa the teater. 
Reaulta for all inatrumenta, including the DAP, are ahown in Exhibit 
54, It ia apparent diat the BI waa the only teat with no evidence of a 
teater effect, aa would be expected. Thia ia not, of courae, poaitive 
evidence of the validity or appropriate aenaitivity of the BI, but at leaat 
there waa no atrange negative evidence. 

The implicationa of Exhibit 54 for the other teata, and indeed for 
the findinga of the atudy, bear aome comment. Firat, doea the reault 
auggeat a reaaon for failure to detect an effect? The exhibit liata the 
code number of the teatera aaaociated with higheat and with the loweat 
meana in each cell where the F-ratio waa aignificant. Thia givea aome 
indication of the extent to which teatera were conaiatent in producing 
an effect, however it may be regarded. Aa far aa the PPVT, PSI, and 
DAP are concerned, the concluaion ia that one teater (05) appeara to 
be aaaociated with low acoring groupa. However, there are two pointa 
to be noted. Theae are that the low PPVT and PSI aamplea of 3-year- 
olda were the aame children ^and that the sample size was, in fact, two. 
Furthermore, a number of the 23 low DAP 5-year-olds associated with 
this tester were Spanish- speaking children. These appear to be suffi- 
cient reasons for disregarding the possibility that this tester was 
biasing the results substantially. 



o 
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With respect to the VSMS, there appear to be two testers asso- 
ciated with high and low scores. Sample sizes range from 17 to 49. 

The tester "effect, ” in this case, could not be on the behavior of the 
child but on the responses of the teacher. The specific distribution of 
children in these high or low groups has not been examined with re- 
spect to S, M, or L. However, Tester 04 tested in M and L centers, 
while Tester 15 tested in all three types. It seems reasonable that 
if Tester 04 was biasing results, it was in favor of a "treatment" effect, 
whereas Tester 15 would tend simply to subtract a constant from each 
level. This is speculation, of course. 

With respect to the PPVT, the high group tester tested five 3- 
year-olds in an L center. The six 4-year-olds who formed this tester's 
low group on the PSI were in the same center. The other high testers 
on the PSI tested in M centers (Tester 13), S and M centers (Tester 16), 
and M and L centers (Tester 18). Sample sizes of children range from 
2 (Tester 05) to 28. With the exception of Tester 19, in no case were 
the extreme groups the total number of children tested by a given tester. 

The overall impression is that while there is evidence for the 
interaction of testers and children,^ it does not seem systematically 
related to particular tester for the PPVT or PSI, nor to S, M, and/or 
L for any of the measures. 

Another criterion of validity of the tests is the extent to which 
they are related to other indicators or measures of performance. A 
very limited opportunity to investigate this possibility was provided by 
data from the Paid and Volunteer Worker's Evaluation Form collected 
from the staff of the centers by the Bureau of the Census. The form 
consists of a series of rating scales to be checked by the worker. One 
set of scales relates to changes in the children. The respondent is 
asked to rate as "much better," "better," "no change," "worse," or 
"much worse" the following: 

Evidence that the sex of the DAP drawing was related to the sex of the 
tester for this sample is reported by Datta, Lois -Ellin and Drake, Ann, 
"Objective Sexual Differentiation in the Drawings of Preschool Children, " 
Paper submitted to Journal of Consulting Psychology. (Mimeographed 
copies are available from Dr. Datta, 2N256, Building 10, National 
Institutes of Health, Bethesda, Md. 20014). 
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I feel that in general children attending the Operation Head 

Start program were changed in the following wayg: 

1. Getting along with other children. 

2. Self-confidence. 

3. Speaking ability. 

4. Everyday manners. 

5. Finishing v^at he starts. 

6. Doing what he's told. 

7. Interested in new things. 

8. Can do things on his own. 

9. Chances of success in kindergarten are. 

There was no way to relate forms filled out by teachers to 
children's scores except in the case of one-class centers. Therefore, 
we used for each item the average rating of all staff members of a 
CDC who submitted a form^ as the index of the center's belief about 
its children. Items were scaled from 1 to 5, with 1 representing "much 
worse." (We recognized the problems inherent in this whole procedure, 
but we believed that it was worth trying to exploit any information 
available. ) 

The center average unadjusted raw score means of each test were 
then correlated with the center average for each item. The product- 
moment coefficients are shown in Exhibit 55. The correlations are 
based on a sample size of 72. 

The results of this admittedly gross procedure are interesting, 
but, like many correlational studies, difficult to interpret. First, the 
items themselves are highly inter cor related (except for "Success in 
Kindergarten," the intercorrelations are all over 0.90). Consequently, 
the questions of interest are; 

• Are the correlations between items for a given test really 

different? 

• Are the correlation coefficients across tests for a given 

item really different? 

The questions can be examined by transforming the coefficients 
to standard normal variables and calculating the standard error of the 

A respondent was required to fill out only one form, so the items are 
impressions about the CDC children as a whole, presumably. 
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EXHIBIT 55 - CORRELATIONS OF CENTER OPINIONS AND TESTS 



Item 


PPVT 


PSI 


VSMS 


BI 


1, Getting Along 


0.289 


0.226 


0,077 


0.382 


2, Self-Confidence 


0.344 


0.290 


0.086 


0.309 


3 . Speaking Ability 


0.339 


0.27 5 


0.116 


0.348 


4. Manners 


0.416 


0.331 


0.110 


0.3 59 


5. Finishing Tasks 


0.327 


0.267 


0.112 


0.304 


6. Obedience 


0.378 


0.295 


0.138 


0.379 


7. New Interests 


0.313 


0.246 


0.088 


0.3 53 


8. Self-Reliance 


0.299 


0.251 


0.060 


0.335 


9. Kindergarten 


0.305 


0.266 


-0.039 


0.057 
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difference between two correlations. Differences on the order of 
0.30 to 0.35 are significant by this procedure. Thus, it is clear that, 
with one possible exception for the kindergarten variable, the variations 
of correlations between items for a given test are not significant. Fur- 
thermore, the differences in correlations for given items across tests 
are not significant either. Thus, this opportunity for this form of 
assessment of validity did not yield information.^ 

Another aspect of the validity of the tests is the question of the 
interaction of test and/or test situation and child. There is growing 

concern that tests such as the PPVT, for instance, may inhibit children 

2 

by the nature of the situation that they present. Thus, Cline, et al, 
suggest that the forced choice, receptive language mode of the PPVT 
is anxiety- provoking to children who see being controlled (forced choice, 
receptive language) as particularly threatening. Cline suggests that 
low-income Negro children are especially likely to show withdrawal 
and/or random behavior under those conditions, and thus to show a 
performance deficit. Therefore, a test such as the PPVT may not be 
sensitive to genuine cognitive changes by virtue of its inhibiting effect. 

We have no rigorous data bearing directly on the test anxiety 
hypothesis. The negative correlations of the PPVT with race (Exhibits 
49 and 51) are always larger than the negative correlations of the PSI 
total score and subtests scores with race. This is consistent with the 
hypothesis that the PPVT, by controlling and thus raising the anxiety 
level more than the more expressive language PSI (relatively), penalizes 

One reason for presenting this analysis, gross though it was, was to 
illustrate some problems that arise in attempting to use different types 
and sources of data for evaluational purposes. 

^Cline, M. G. , Judith Marshall, and Eunice Stansbury, "Stanford-Binet, 
PPVT, and Low Income Preschoolers: New Pitfalls for Old Tests." 
Paper presented to Eastern Psychological Association, April 1966. 

(A mimeographed copy is available from Dr. Marvin G. Cline, Institute 
for Youth Studies, Howard University, Washington, D. C. , 20001). 

^Loc. cit. 
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particularly those children sensitive to such restriction. However, 
there are other interpretations possible. 

One set of evidence comes from ratings of test behavior made by 
testers directly following administration of the PPVT (generally made 
while the child was doing the DAP). The front of the PPVT test record 
provides nine descriptors of test behavior arranged in the form of 3- 
category polar scales. Testers were not provided with criteria for 
using these scales, nor was the use of them stressed during tester 
training. Consequently, any consideration of the results of these ratings 
must be made under strong caveats concerning their reliability and va- 
lidity. In this same vein, the further confounding factor of observation 
time should also be noted; by the time that the tester rated the child, 
he had observed him reacting to the PSI as well as to the PPVT. 

With all of these qualifications, we show two types of classifications 
of the results of the ratings. We selected five variables that appeared 
to be (1) related to the test anxiety hypothesis, and (2) somewhat con- 
ducive to the development of a meaningful discrimination on the part of 
the tester. Sorting on the basis of variables to be described, we made 
a count of the number of children rated at the extreme negative end of 
each variable or item and the remainder of the children. Thus, for 
each test behavior variable or item, we determined the frequencies of 
extreme negative ratings (N^^) and non-extreme negative ratings by: 

• Race and sex 

• PPVT IQ above or below the mean IQ for Group 1 (IQ = 85). 

The percentages of children rated at the extreme negative end of an 

item scale, by race and sex, for S and L, and the two samples combined 
are shown in Exhibit 56. Percentages of children above and below the Group 
1 average IQ who were rated extremely negative, for essentially the same 
S and L samples and for the M sample as well, are shown in Exhibit 57. 

The five variables, with the extreme negative descriptor in each case, are: 



1 



Samples include non-eligible 



as well as eligible children. 
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EXHIBIT 57 - ASSOCIATION OF LOW RATINGS ON SELECTED PPVT TEST BEHAVIOR SCALES 

WITH PPVT IQ 
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• Guessing: resisted guessing. 

• Speed of response; slow. 

• Verbalization: taciturn. 

• Attention span: distractable. 

• Need for praise: much needed. 

The reader should bear in mind the varying compositions of the 
S, M, and L samples in considering these data. (They are unweighted 
percentages. ) He should also bear in mind that there is no indication 
of the extent to which the extreme negative ratings tended to be the 
products of only a few testers. It is clear that responses tended to be 
extremely negative quite a bit less than one- third of the time. 

It appears that the Negro children, disproportionately often (see 
Exhibit 2 for the total sample composition), were rated, following the 
administration of the PPVT, as having forms of behavior consistent 
with the test anxiety hypothesis. Further, these extremes of behavior 
tended to be related to lower standing on the PPVT IQ scale (used here 
for convenience to control for age). There was also some tendency for 
the proportion of children rated extremely negative or low in the total 
treatment level samples to decrease from S to L. The exception 
occurred in the "attention span" area, and here it is important to bear 
in mind that L program children were generally younger than S or M 
types. There is also an indication in Exhibit 56 that, with the exception 
of "guessh.g," Negro boys were more frequently rated low on the test 
behavior variables than Negro girls. 

Thus, there appears to be some marginally interpretable evidence 
consistent with the test anxiety hypothesis, and, indeed, consistent with 
the hypothesis that test anxiety is higher among Negro than white chil- 
dren, when other things are equal 

If it is true that testing, and especially receptive language testing, 
is particularly threatening to low-income Negro children, then it may 
be asked whether Head Start programs help such children change their 
perception and reactions to a situation that is analogous in many ways 
to the structure of public school classroom operations. The data shown 
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here suggest some tendency for a change in a positive direction, but 
a change occurring more with white than non-white children. 

There is a further, more remote, consideration bearing on 
validity in some of the data available in this study. Examination of 
individual items in the PSI, for example, reveals a number of interest- 
ing points (see Exhibit 3 5). The difficulty of most items clearly varies 
with age, although for some it does not. An example of a non-age 
sensitive item is Subtest 1, item 4 (”When is your birthday?"), which 
was quite uniformly difficult for all except two of the three L 6-year- 
olds. Similarly, item 22 in the same subtest ("Put one car in the 
middle-sized box") was not much easier for older than for younger 
children. The question of interest is: are these items difficult for 
children in the 3- to 6-year-old age range generally, or are they 
difficult for low- income children? This question cannot be addressed 
directly without comparative data from non- low- income children. 

Some of the uniformly difficult items may simply be outside the general 
vocabulary range or experience of young children. However, in light 
of the frequently reported lack of sensitivity of culturally deprived 
children to temporal distinctions, it is interesting to note the more or 
less uniformly low level of correct identification of times (Subtest 2, 
items 35, 36, and 37, having to do with identifying the time of year 
that it is (1) hottest, (2) coldest, and (3) at present). Even for 6-year- 
olds these were difficult items. It is tempting to take this difficulty 
as consistent with other findings about the lack of temporal discrimi- 
nation in poor children. 

Were the instruments used appropriate? Or were they too irrele- 
vant, unreliable, insensitive, inhibiting, or overly contaminated by 
nuisance factors to detect genuine cognitive, social, or behavioral 
changes associated with Head Start program participation? There is 
no simple yes or no answer to these questions. We have reviewed 
evidence available from our own data and offer the following comments 
as our opinions, based more on impressions than on cold analysis. 
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1. For purposes of detecting general shifts in performance 

in a situation calling for use of receptive language skills and/or willing- 
ness and ability to operate according to the demands of authority 
(teacher, tester, etc. )--that is, to play the game--the PPVT seems to be 
fairly appropriate when differences in raw score means are used as 
the measures of effects. In our study, the PPVT was generally sensi- 
tive to variables to which it ought to be sensitive. To the extent that 
it simulates one form of situation or relationship which Head Start 
children, like others, will ineluctably encounter with increasing fre- 
quency and seriousness in their public school careers as the system 
presently works, it matters little whether the changes measured by it 
are cognitive, motivational, or both. Functionally, the result is the 
same. The challenge for Head Start is to find and clarify those pro- 
cedures and techniques that maximize the development of effective 
cognitive skills, whether such techniques are directed at cognition, 
motivation, or both. It does not, in general, appear to have done so yet. 
We are not saying that other tests might not be equally or more appro- 
priate. We are saying that, other things being equal (including, inci- 
dentally, administration costs), we think that the PPVT is reasonably 
appropriate for Head Start program evaluation purposes, 

2. We think that the revised PSI is at least as appropriate for 
evaluation purposes as the PPVT, However, administration costs, 
including tester training, are higher. The PSI in many respects pro- 
vides more information that is of operational significance than does 
the PPVT. Furthermore, the PSI appears a little more sensitive to 
variables that one would expect it to be, and possibly a little less sensi- 

♦ 

tive to confounding variables. The PSI was more sensitive to the urban- 
ization measure than the PPVT (see Exhibit 45), although we did not 
examine individual items with respect to that variable. Somewhat more 
of the total PSI variance was accounted for by the independent variables 
considered in this study. Thus, the PSI is probably more sensitive to 
local conditions than the PPVT and as such makes a better instrument 
for local diagnostic purposes for children who were not extremely 
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handicapped than does the PPVT. For our purposes, it seemed no more 
appropriate than the PPVT, and it was substantially more costly to ad- 
minister and to train testers for than the PPVT. We have reservations 
about the order in which some items occur on the PSI (see Appendix D). 
We think that the present grouping of subtest items may enhance what- 
ever test anxiety is inherent in the situation for low- income children. 

In some ways, the PSI seems constructed more to accommodate the 
academic standards of test specialists than to provide interpretable infor- 
mation about cognitive content or achievement. Nevertheless, it has 
at least a face validity for evaluation of Head Start programs that the 
PPVT lacks. 

3. The Behavior Inventory remains an enigma. It told us 
little about effects (neither, of course, did other instruments). It told 
us relatively little about sensitivities to background or control variables. 
A number of teachers found many items ambiguous or hard to answer 
because of their multidimensionality. We found that the design of the 
form contributed substantially to the omission of responses by teachers 
and that a factor analysis of BI subtests gave us meaningless results. 

We found it difficult to imagine what criteria teachers used in making 
some ratings. Our overall opinion is that, before the BI is used further 
for diagnostic or evaluation purposes, systematic investigation and 
evaluation of it as an instrument should be undertaken. There is some 
evidence that it is grossly sensitive. However, there are too many 
uncertainties about what it really is measuring under conditions such 

as ours to recommend its further general use without more research 
and evaluation. 

4. The Vineland Social Maturity Scale, as employed in this 
study, seems grossly appropriate, but not worth the cost of tester 
training and test administration. The negative beta coefficients for 
center size in the stepwise regression analysis, plus the negative beta 
for the amount of teacher's pre-school experience, plus the significantly 
positive age betas (and correlations), suggest a tendency to "report" in 
terms of an age stereotype or not to report enough information. (Testers 
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sometimes reported that teachers in larger centers seemed less knowl- 
edgeable about or familiar with the children. ) The overall average 
social quotients (SQ's) obtained in this study were about 100, as they 
should have been if this had been a typical group or if teachers had 
responded in terms of a typical stereotype. We found evidence of a 
tester-teacher interaction with the VSMS, and of a tester bias. 

It is not clear that the VSMS would be sensitive to effects of Head 
Start treatment, even if the parent were the respondent. We think that 
the value of the VSMS in this study was to establish that the children in 
our sample probably were not conspicuously advanced or retarded for 
their ages in terms of the skills and abilities examined by the interview. 

b. Reliability 

The question of reliability is basically the question of 
error of measurement. It is often assessed by a variety of methods, 
including observations made at different points in time (e.g., test- 
retest), or analyses of a single set of observations (e.g., split-half 
correla^'ions or item statistics analyses). 

As stated earlier, there was no opportunity in this study to make 
any of the standard reliability checks in terms of repeated observations 
or multiple observers. We did calculate KR-EO coefficients for the 
PSI subtests, for four age groups, and for each treatment level. The 
coefficients are presented in Exhibit 58. The coefficients for Subtest 
2 are based only on items 27 through 42, since items 43 through 47 
were 2-point items. Sample sizes by age and program level are the 

same as those shown in Exhibit 3 5. 

The coefficients for the 3-year-olds and for the L program's 6- 
year-olds are based on very few observations. This seems to be re- 
flected in a greater diversity of coefficients for these groups. Other- 
wise, the coefficients seem much alike. Generally the M level coeffi- 
cients appear to be higher than those around them. 

^Kuder, G.F. andM.W. Richardson, " The Theory of the Estimation of 
Tfist Reliability. " Psychometrika , 1937, Vol. 2, No. 3, pp. 151-160. 
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EXHIBIT 58 - KR-20 COEFFICIENTS FOR PSI SUBTESTS 



Sub test 


Age 


Program Duration 


Short 


Medium 


Long 




3 


0.98 


0.76 


0.72 




4 


0.74 


0.86 


0.73 


1 


5 


0.78 


0.82 


0.77 




6 


0.73 


0.87 


0.57 




3 


0.53 


0.62 


0.67 


o(l) 


4 


0.74 


0.72 


0.81 


L 


5 


0.73 


0.80 


0.7 0 




6 


0.78 


0.82 


0.79 




3 




0.15 


0.60 




4 


0.60 


0.68 


0.63 


3 


5 


0.76 


0.7 5 


0.75 




6 


0.73 


0.82 


0.61 




3 


0.67 


0.73 


0.72 


A 


4 


0.67 


0.78 


0.67 


4 


5 


0.75 


0.80 


0.67 




6 


0.70 


0.72 


- 



Note: (1) Based on items 27 through 42 only 
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By this index the reliability of the PSI subtests appears relatively 
satisfactory. 

For the main part of the study, our concern has been with reduc- 
tions of total variances. Test reliability or unreliability is simply one 
factor contributing to an unknown extent to our error variance. We 
believe that we have specified fairly the reliability of the means, differ- 
ences, and weights obtained, but there is no way of relating test reli- 
ability sums to the sums in those models. 

5, Effects of Head Start Which May Not Be Immediately 
Noticeable 

If there are latent effects, by definition we would not have 
detected them with end testing. There is no question that longitudinal 
or cohort- tracking studies are vitally needed for major social programs 
and should, in the long run, provide more reliable and interpretable 
information than short-term studies such as ours. The dilemma, from 
a program point of view, is contained in the phrase "in the long run," 
Short-term studies have the most potential for influencing practical 
correction of discrepancies or inadequacies before procedures, opera- 
tions, practices, attitudes, and so on become institutionalized. On the 
other hand, it appears that results from short-term studies are the 
more difficult to interpret both on technical and theoretical grounds. 

We are inclined to think that the trouble with short-term studies is just 
that; they are simply short-term studies. To the extent that they pro- 
vide a base for continued observation, their value should be enhanced 
substantially. 

We conclude that there are a number of reasons why we may have 
failed to detect social or psychological effects of Head Start programs 
in our samples. We have discussed what seem likely to be the principal 
ones. For us, the overriding constraint is the lack of information about 
local program objectives and goals. Without better definitions, it ap- 
pears that evaluation may continue to be a marginally informative 
operation. 
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C, Exceptions 

There were two exceptions to the trend of results that warrant 
comment. One was the significantly low adjusted BI mean for the M 
treatment level (Exhibit 39). The other was the significant beta coeffi- 
cient for "weeks" for PPVT and PSI equations for Group 5, children 
tested in Spanish (Exhibit 45). We shall comment on these in turn. 

1. The Behavior Inventory 

There are several choices of explanation for the result, 
depending upon the assumptions accepted. Basically, the result occurred 
because the model applied essentially the same type of correction to the 
BI that it did to each of the other measures: it lowered the M mean. In 
the case of the BI, it did so to compensate for the higher level of every 
covariate except population, modified by the correlation of the BI with 
those variables in the overall sample, and by their intercorrelation. It 
is apparent that the large decrement for the M level BI was associated 
with race (there were disproportionately more whites in the M sample 
than in the other two). Decrements beyond that, in diminishing order 
of magnitude, were contributed by age, poverty, and sex. 

If we examine the BI means, adjusted and unadjusted, when age, 
sex, and race are held constant (Exhibit 43), we find much the same re- 
sult with two conspicuous exceptions: the M level means are adjusted 
upwards for the 5-year- old white males and females. The adjusted 
means show the same V configuration over SML in seven of the 10 groups 
as the By-Center means, ^ as compared with eight of the 10 for the VSMS, 
and five of 10 for the PPVT and PSI. Thus, it appears that especially 
where the two teacher-related instruments are concerned, the V configu- 
ration is not just a correction for age, sex, and race inequalities. 

In the By- Center regression analysis, the independent variables 
accounted for about 9 percent of the total BI variance, less than for any 



It will be recalled that none of the F- ratios for the By- Child analyses 
was significant at the 0.05 level of confidence. 
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of the other measures. In the By- Child stepwise regression analyses, 
the 13 independent variables used accounted for a little over 8 percent 
of the variance for Group 3, with age and population having the only 
significant betas. 

One hypothesis to account for these results is based on the as- 
sumptions that the BI provides an accurate index of behavior and that 
the S, M, and L groups were comparable in behavior by this index at 
the start of their programs. Given these, it would appear that there 
is, over time, a shift in the behavior of Head Start children away from 
forms valued by or acceptable to teachers, followed by a readjustment 
or trend toward desired forms. Thus, the hypothesis is that Head 
Start encourages the occurrence of a behavioral change sequence which 
tends to be (as well as to appear) undesirable or destructive, but which 
ultimately moves toward positive values. This concept is analogous, 
in some respects, to the transference sequence expected by many 
psychotherapists. 

A second hypothesis taices account of one further bit of data: the 
positive correlation of the BI scores with population. This hypothesis 
is that there is some factor related to the size of community that is 
related either to the behavior of the children, or to the subjective 
criteria used by teachers in making the ratings. There could be some 
form of urban/rural difference in teachers' standard or criteria, or 
in the behavior of children, or both. PRC has not, however, examined 
the slopes of the regression on population for S, M, and L to determine 
whether they were in fact dissimilar. This needs to be done before 
proceeding with further hypothesizing about factors associated with dif- 
ferences in community size, 

A final hypothesis is that we were unlucky. The F- ratio for the 
BI was significant, but just barely. The exact probability was 0,04. 
Whatever variables exerted significant control over ratings on the BI 
were not the ones for which we accounted. If the F- ratio had been just 
slightly smaller, and thus not significant at (arbitrarily) the 0,05 level, 
we would have thought no more about it. Uncertainties about the BI 
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have already been discussed in subsection V. B, 4. Until more is known 
about the properties of the BI as an instrument, further speculation or 
hypothesizing does not seem worthwhile. In any event, we cannot 
reasonably conclude that the V configuration of the BI means for S, M, 
and L samples points to a program or treatment effect, 

2, Children Tested in Spanish 

The results of the By- Child stepwise regression analyses 
for Group 5, the children tested in Spanish, contrasted markedly with 
the results for other groups (see Exhibit 45), Specifically, for the re- 
gression equations for the PPVT and the PSI total score, the beta 
weights for weeks and for hours were significant. Yet more striking, 
the coefficient for weeks was in both cases negative (the longer the time 
in program, the lower the score). More striking still was the fact that 
no other independent variables had significant betas. Finally, the total 
proportions of variance accounted for were higher for both tests with 
this group than for any other group (over 55 percent). 

Note also in Exhibit 45 that this group had the highest average 
age (72,67 months) of any group. 

On the face of it, the results seem to suggest that the longer (in 
terms of weeks of program) that Spanish- speaking children stay in 
Head Start programs, the lower they score on the PPVT and PSI. How- 
ever, the longer (in terms of hours of daily program) that they stay at a 
Head Start CDC each day, the higher they score. 

It must be noted that the Spanish versions of both tests were not 
standardized (see Appendix D for protocols and Spanish translations 
used). 

No clear hypotheses emerge from the data. Forty-four percent 
of the PPVT variance was accounted for by weeks, age, sex, and 
population, entering the analysis in that order. Hours next accounted 
for an additional 6 percent, income for the next 3 percent, and family 
size and center size for the remaining 2 percent. With respect to 
the PSI, 47 percent of the variance was accounted for by weeks, age. 
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and family size (relatively high negative partial correlation). The 
next 8 percent was accounted for by mother’s education (1-1/2 percent), 
hours (1-1/2 percent), and income (5 percent). Center size, population, 
and sex, in that order, account for the remaining 3 percent. 

The partial correlations of the independent variables with the two 
tests are notably different, in some respects. They are shown in Ex- 
hibit 59. 

EXHIBIT 59 - PARTIAL CORRELATIONS FOR SPANISH- SPEAKING 

GROUP 5 



Independent 

Variable 


Dependent Variable 


PPVT 


PSI 


1. Age 


0,18 


0,16 


2, Sex 


-0.33 


0,15 


3, Weeks 


-0,37 


-0,37 


4, Hours 


0,39 


0,37 


5, Mother's Education 


0.00 


-0,23 


6, Population 


0.32 


0.25 


7. Center Size 


0.10 


0.28 


8, Family Income 


0,26 


0,24 


9. Family Size 


0,15 


-0,30 



It is easy to suspect a selection factor operating across the S, 

M, and L Spanish- speaking samples. However, there is no way of 
demonstrating it. Furthermore, the partial correlation coefficients, 
along with the variables accounting for the variance of the PPVT and 
of the PSI, suggest two quite different mechanisms operating with the 
two tests. 

Whatever the explanatory hypotheses, it is clear that this sample 
of Spanish- speaking children is substantially behind the other samples 
on the two individual tests, and on the VSMS as well, (This is true 
despite the fact that they were presumably given an advantage in lan- 
guage for testing purposes which they do not enjoy in school. ) 
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VI. CONCLUSIONS 



A large amount of data has been presented and many questions 
raised and discussed. Yet further data and discussions are furnished in 
appendixes to this report. This section will present the primary conclu- 
sions that stand out in the context of the overall study. No attempt will 
be made to list all subsidiary conclusions. 

1. Subject to limitations in interpretation imposed by the design 
of the study, there was no statistically reliable evidence of a change in 
performance or rating of children in the major eligible samples on four 
test instruments which could be related to the length of a Head Start 
program or to the length of time that a child had attended a Head Start 
center. The conclusion also holds for various subsamples of children 

of similar age, sex, and race. However, the conclusion loses opera- 
tional significance for subgroups as sample sizes decrease, since only 
very large changes can be assessed reliably when samples are very small. 

The one significant variation in the test means between duration 
levels occurred with the Behavior Inventory. Various interpretations or 
explanations of the deviation were considered. PRC concluded that the 
variation was probably related to error of measurement and not to effect 
of the programs. 

2. This conclusion does not vitiate the following hypotheses or 
possibilities concerning 1966 full-year Head Start programs: 

• Children improved in many ways not measured by the tests, 
including health, nutritional status, and attitude toward 
schools and teachers. 

• Children improved measurably on the dimensions measured 
by the tests or assessed by the rating or interview scales 
relatively early in their participation in programs. 

• Parents, teachers, other staff members, and community 
organizations benefited from participation or involvement in 
the 1966 full-year programs. 

• Beneficial effects of participation are latent and will become 
manifest after the children enter school., 
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None of these possibilities could be examined within the context of this 
study. 

3. There were a number of factors which could have acted to 
obscure the observation and measurement of a Head Start treatment 
effect. Of these, the more significant methodologically appear to be: 

• Lack of direct evidence that major experimental samples 
were comparable at the start of the programs. 

• Some uncertainty concerning the validity and reliability of 
at least one of the measuring instruments. 

• Lack of specificity of information about needs and goals 
associated with different programs. 

The first is by far the most serious. As a result of the criteria 
used to identify the programs in the three duration levels studied, the 
distribution of centers was quite unlike any usual geographic distribution. 
Whether there were underlying selective factors differentially associated 
with the emergence of funded programs at different points in time during 
fiscal year 1966 is a matter of speculation. 

4. The concept of an effect is complex and deserves close 
attention in the evaluation of large-scale programs aimed at changing 
behavior. 

5. There is some evidence that the generally lower performance 
of Negro children relative to white children, especially as measured by 
the PPVT raw score, may be the result of a motivational rather than 

(or as well as) a cognitive factor. The situation that may cause the 
depressed scores is analogous to the situation and demands of the school 
classroom. If this is the case, it is certainly a condition which Head 
Start programs should be trying to correct. 

6. Families of the children in the study were very similar in a 
number of characteristics in the different program levels. The 
characteristics of staff members and staff structures were generally 
similar for the three main samples, although some differences were 
noted. On a very gross basis of measurement, no significant relationship 
was observed (with one exception) between test scores and the amount of 
teachers* experience with pre-schoolers or with children from conditions 
of poverty. 
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In fummaryy this itudy conducted end-of-program teiti of samplei 
of children in 72 1966 full-year Head Start Child Development Centers 
repreaonting program! of three main length! or duration!. The eample 
of program! ranged from 6 to 36 week! in length at time of teeting, 
and the average length! of the three main ! ample! were 12.4, 19.3, and 
27.6 week!. A !ample of children in each center wa! teeted with the 
Peabody Picture Vocabulary Teat (PPVT), a teat of general verbal 
ability, and the Caldwell -Soule Pre-School Inventory (PSI), which ia 
de.igned to mea.ure performance in several areas of social and cognitive 
achievement. Teacher! were interviewed to provide rating! of the 
children on the Vineland Scale of Social Maturity (VSMS), and teacher a 
alao completed rating! of the children on the Operation Head Start 
Behavior Inventory (BI). The average test acorea of the children in 
ahort-term center! (6 to 15 weeks) provide the basis of comparison for 
the examination of effects of Head Start programs on the children. 

Statistical analyses of the results were undertaken in which a 
number of background variables such as age, sex, race, size of town, 
etc., were taken into consideration. As noted above, the overall 
result was that there was no significant indication of a general increase 
of scores with length of program. 

The study did not examine the content or structure of the programs 
in the sample. Nor was any systematic attempt made to rate or evaluate 
the quality of the programs, personnel, or operations independently of 
the test scores. Consequently, PRC does not feel that the results 
, mean either that the programs accomplished nothing, or that many 
possible short- and long-term benefits to children, parents, and staff 
members did not occur. There are numerous variables involved in an 
enterprise as complex as Head Start, and the possible impacts and 
benefits are manifold. With respect to the functions, processes, or 
skills of children presumably assessed by the instruments used in this 
study, it appears that, overall, the gains to be expected with the longer 
programs exemplified by the 1966 full-year sample studied are small. 

The challenge, in the continued evaluation of programs such as Head 
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Start, it to discover yet more precise, and at the same time comprehen- 
sive, means of depicting the true nature of the total array of benefits, 
and to translate such findings in further improvements in program 
design and operations. 
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APPENDIX A 



SAMPLES 



In this appendix the subuniverses and eventual S, M, and L sam- 
ple centers are listed. Exhibits A-1, A-2, and A-3 list the universes 
of CDC's for S, M, and L samples. The 173 centers are listed by 
grant and center number, local address, and city and state. In addi- 
tion, for each program the early estimates of number of classes and 
of the length of the program are given. Finally, the cumulative weight 
for each center in the ordering of the subuniverse is listed, as well as 
the random start for each subuniverse. 

Exhibit A -4 lists the S, M, and L centers eventually tested. Cen- 
ters are identified by grant and center number and city and state. 



EXHIBIT A-1 LOST OF ELIGIBLE SHORT-TERM CENSUS SAMPLE CENTERS FROM WHICH THE 

PRC SUBSAMPLE WAS DRAWN 
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( 3 ) Arithmetic error. 



PRC R-886 
B-1 



APPENDIX B 

STATISTICAL MODELS AND ANALYSES 



A, Introduction 

This appendix states the models underlying PRC's analyses and 
the reasons why it is felt that they are applicable. Subsection B deals 
with the analyses of covariance. One of these is far and away the most 
important of the analyses and an attempt is made to give an idea of its 
power. The adequacy of linear models is addressed in subsection C 
(dealing with regression), rather than in subsection B. Subsection D 
explains some of the peripheral analyses. Cutting across the three 
sections is the important subject of the center (cluster) effect. 

B. The Analyses of Covariance 

The analysis most informative about effects of S, M, and L is the 
By- Center analysis of covariance; indeed, the study was designed for 
this sort of analysis. Center averages were used as observations and 
concomitant variables. Because centers were clusters in the sampling, 
the scores of children within a center are correlated, while center 
averages are independent. Moreover, least squares methods are rel- 
atively insensitive to inequality of variances, so the differing numbers 
of children in the center averages should do little harm. Thus, in 

term.s of closeness to a useful least squares m.odel, center averages 
are good. 

Two disadvantages of the center average model should be men- 
tioned. Comments about the first, loss of power, are vexed by the 
question, relative to what? There is a center component in the error 
sum of squares; if this could be estimated and removed, PRC's test 
would be more powerful. (It should be noted that the corrections for 
race, etc. probably considerably reduce what might otherwise be 
taken for a center component. ) 
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The lecond die&dv&ntage is that it it somewhat harder to give 
precise meaning to the model's parameters than it desirable because 
centers were chosen with probability proportional to the number of 
classes in the center. For example, the overall mean is not that of a 
center chosen at random but, perhaps more relevantly, an approxima- 
tion to that of a child chosen at random. There is no doubt, however, 
about the relevance of the model's parameters, and of the size aind 
direction of their estimates which PRC would have liked to have seen. 



for the ith center of type j ( i = 1, . . . , n. ; j =S, M, L) . Assume that 

J 



where p s a 4 x 5 matrix of coefficients 

X^. s the vector of the five concomitant variables (age, sex, 

^ race, population, and poverty) 

X s the overall average of the Xij 

This model differs from a composite of four univariate models only in 
the allowance for non-zero off-diagonal elements of L --i.e., for 
correlation between the test scores. 

A ratio of determinants called Wilks' X yields the likelihood 
ratio test of the hypothesis 



(It should be noted that, since the |i's are 4-vectors, the alternatives 
to H are complex. ) Lambda has a distribution that in this case (two 
treatment degrees of freedom) can be exactly transformed to an F- 
distribution. 





and 
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The estimatee of all parameters of the model except those in- 
volving the off-diagonal elements of Z are precisely those of the indi- 
vidual univariate models. As a result, the four F- ratios are available, 
if somewhat hard to interpret simultaneously. Corresponding to any 
one of the F- ratios are the Scheff^ confidence intervals with a coeffi- 
cient applying simultaneously to all contrasts. 

The confidence intervale for the treatment contrasts give a fair 
idea of the power of one of the univariate testa. More directly, the 
power of a univariate analysis of covariance can be approximated by 
noting that the ti bear the same relation to times the non- centrality 
parameter 6 that their estimates, the adjusted treatment means, 
bear to the treatment sum of squares. The treatment sum of squares. ' 
which is never larger than the sum of squared deviations of the adjusted 
teeatment means from the overall mean, is 79, 83, and 59 percent of 
It for the PPVT, PSI, and BI, respectively. As an approximation, 
then, to avoid the laborious recalculation of the non- centrality param- 
eter for each set of putative n^'s , we shall assume that 0^6^ is always 70 
percent of the adjusted treatment mean sum of squares. (It should be 
noted that the above percentages change partly because the differences 

between the adjusted means change; in fact, PRC investigated power 
for only one test, the PPVT). 

We take as O’ its estimate, 19s2; because the difference between 
and infinite degrees of freedom in the denominator has small effect 
on the power charts, the non- central chi-square tables may be used. 

For the configuration of n^'s we take |J^ = |ig + A and |i = |i + A. 

The results are shown in Exhibit B-1. M S IT 

EXHIBIT B-1 POWER OF PPVT F-TEST 



A 


Sum of 
Squares 


Approximate 
Non- Centrality 
Parameter 


Power of a 
5% Test 


2 

4 

6 


43 

172 

387 


1.57 

6.27 

14.10 


0.19 

0.60 

0.93 
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Thui it can be leen that a 2-point PPVT ipread would probably not 
have been picked up, while a 4-point epread probably would have. 

The By- Child analyiee of covariance have explicitly the same 
model ai the By-Center analyiis above. Of course, p is now 4x4, 
and the four concomitant variables are population, income, family 
size, and hours per day. Sex, age, and race specify the analysis. 

Of course, there are correlations between children of the same center, 
and the effect of ignoring these correlations in the By- Child analyses 
is simply not known. The alternatives, however, would have been 
either to destroy (in some other way) the orthogonality of the design or 
fractionate and discard data to the point where cell sizes were minute 
and degrees of freedom squandered on hypotheses of no interest. Given 
the desirability of doing some By- Child analyses, the ones performed 
seem as good as possible. 

Again the confidence intervals give a fair idea of the power of the 
tests with the largest and smallest numbers of degrees of freedom. 
However, with respect to interpretability, there is the question of 
simultaneity. To the extent that the ten X's are correlated, we have 
a good idea of how: they are repeated observations of the same thing. 
Therefore, the assumption that the X's are independent will err on the 
side of overestimating significance. Under the independence assumption, 
the estimated significance levels can be combined by taking minus two 
times their logarithm and adding. The result, shown in Exhibit B-2, 
is 29.7 8, which is the 92nd percent point of the chi-square distribution 
with 20 degrees of freedom--i. e. , significant at the 8 percent level. 

In view of the correlation, we must lean toward the conclusion that the 
one significant X is happenstance. The yet more muddled question of 
the significance of the 40 F- ratios is obviated by the fact that none of 
them is significantly large--!, e. , at the 5 percent level. 

C, Regression 

A By- Center multivariate multiple regression was done with ex- 
posure in weeks as one of the independent variables. There were 



EXHIBIT B-2 COMBINATION OF THE LAMBDA'S 
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Number of 
Analyiia 


Lambda 


Total Number 
of Children 
in S, M, and L 


Significance 
Level 6 


-2 log^ 


1 


0.2970 


18 


0.2000 


3.22 


2 


0.0006 


12 


0.0060 


10.24 


3 


0.5484 


33 


0.0900 


4.82 


4 


0.9953 


87 


0.9997 


0.00 


5 


0.6417 


32 


0.2500 


2.78 


6 


0.9496 


68 


0.9400 


0.12 


7 


0.7564 


50 


0.1900 


3.32 


8 


0.6977 


42 


0.1700 


3.54 


9 


0.8737 


55 


0.6400 


0.88 


10 


0.8689 


51 


0.6500 


0.86 


Total 


29.78 
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two main reasont for doing thii. Fint, it was a way of uiing in an 
analysis the data from the children who were "ineligible" for the analy- 
sis of covariance* Second^ PHC believed that there would be a value 
in being able to examine the beta coefficients of the independent vari- 
ables taken together. 

If Uj is the 4- vector of average scores from the ith center, 
the model is 



where p - a 4 x 6 matrix of coefficients 

s the center's vector of concomitant variables (the same as 
those of the analysis of covariance, although calculated 
including the ineligible children, plus exposure. ) 

Again, the allowance for non-zero off-diagonal elements in E is the 
only difference from the usual four univariate models; the parameter 
estimates are the same with the addition of estimates of the intertest 
covariances (correlations) about, of course, the assumed means shown 



PRC assessed the adequacy of the univariate regression models 
and, indirectly, the adequacy of the similarly linear analysis of co- 
variance models. The four sets of residuals were obtained. The PPV' 
residuals were plotted against the fitted values and the reassuring 
result is shown as Exhibit B-3. Exhibit B-4 details a comparison of 
the range of each of the four sets of residuals divided by the estimate 
of the appropriate standard deviation with the 10 percent and 0.5 per- 
cent points of the (only approximately relevant) distribution of the ratio 
of range to standard deviation in a single sample of 63. The PPVT, 




and 



V (Uj) = Z 



above 
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EXHIBIT B-3 PPVT RESIDUALS SCATTER PLOT 



PRC R-886 
B-8 



PSI, and BI residuals are well within bounds, but there is an outlier 
among the VSMS residuals (probably only one), fur the replacement of 
the smallest residual by the next smallest would have reduced the 
normalized range to 3,96 (mutatis non mutandis). 

EXHIBIT B-4 RANGE OF RESIDUALS 



Test 


Normalized 

Range 


Upper 
10% Point 


Upper 
0.5% Point 


PPVT 


4.42 






PSI 


4.21 






BI 


5.33 


5.32 


6.13 


VSMS 


7.03 







It is unlikely that the results of the analyses would be much 
changed under a scheme for coping with the one big negative VSMS 
residual. 

The details of the failure of either an age- squared variable or 
a sex- times- race variable to improve significantly the PPVT fit appear 
in Section IV. 

By- Child stepwise regression analyses were done for each of the 
four test scores. Since the program printed out all possible product- 
moment correlations, it was possible to estimate the intertest corre- 
lations about the linear mean. There was considerable selection of 
inputs to even the largest stepwise regression analysis, since only 
those eligible English- tested children (517 out of 764) who gave the 
Bureau of the Census information about income, family size, and 
mother's education could enter. However, their average scores and 
demographic characteristics look much like the full sample's. The two 
groups requiring information on father's education are, of course, 
different from the population as a whole, but not startlingly unlike each 
other. 



o 
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The stepwise regression analysis is a series of regressions, 
each of which includes one more independent variable in addition to 
those of its predecessor. After the first step, the additional variable 
chosen is the one that yields the largest studentized coefficient (T -value) 
in combination with the variables of the previous step. Exposure was 
always entered first--i. e. , the first step was always the simple re- 
gression of score on exposure. Once entered, variables could not 
drop out at a later step. As a result, the effect of entering exposure 
at any step could be inferred. This procedure did not, of course, 
yield the combination of independent variables such that the addition of 
exposure would have produced the largest possible gain in proportion 
of variance explained. Indeed, short of exhaustion, there is no proce- 
dure. From the results, it is hard to believe that the stated combination 
would not change radically from sample to sample, but from these poor 
substitutes for a design it is difficult to malce an inference of even the 
slightest delicacy. 



D. Other 



Means for various subgroups were calculated independently of the 

above analyses and reported along with a standard error (SEM). This 

SEM was calculated as for the overall average in a one-way random 

effects analysis of variance, where the random effect is the center 

effect. More precisely, for any attribute let be the measure of it 

for the ith child in center j(i=l, ..., n.;j = l, k). Let 

3 

N = S n. . We assume . 

j J 



X. . = li + a. + e. . 

J ij 



where the center effects a. and the errors e.. are independent among 

J J 

and between themselves. Assume common center and error variances 
and name them as follows: 



o 
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= varja.} 

fff = Var /e. .1 
® I iJ/ 

Then, the variance of the mean X is given by 




2 2 

Since unbiased estimates of and cr^ are given by 




(K-1)(MS^ - MSJ 



N - 



2nf 



and 





where and ^-re the between- and within- mean squares, 

respectively, from the analysis of variance table, a reasonable esti- 
mate of Var I is obtained by replacing and in the state- 
ment of the variance of the mean given above by their hatted counter- 
parts. The reported SEM's are the square roots of the estimates of 
^Var "IXl so obtained. In those rare cases where the component 
was so negative that the estimate of Var Ixj* would have been negative, the 
CTj^ component was taken as zero. 

For each of ten centers we calculated the F- ratios of between- 
. class to within-class mean square errors for the four scores (see 
Appendix E, subsection C), None of the PPVT ratios, for example, 

4.1.^ 5 percent level. To get some idea of what kind 
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of PPVT differences might have yielded a significant F, it should be 
noted that, for an 80 percent chance of at least one of ten tests to be 
significant, each test must have a power of 15 percent. We shall solve 
for the sort of mean difference needed to give 15 percent power. 

Five of the centers had four classes, while five had two classes. 
Assume that the means of the 4- class centers are equally spaced over 
an interval and that three scores are observed from each of the 
four classes. Then, for a power of 15 percent, A^ must be about 
1.2 a . If we assume that seven scores are observed from each of the 
classes in the 2-class centers, a 5 percent F-test with power 15 per- 
cent is obtained when the mean difference ^ is about 0.57 a. 

The average of the ten mean square errors is 114, which suggests 
taking a = 10.7. Thus, if the 4-class PPVT means were spread over 
12.8 points within each center and the 2-class PPVT mieans were 6.1 
points apart, we should have had a significant F with a probability of 0.8. 
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APPENDIX C 

TESTERS: SELECTION, TRAINING, 
AND SUPERVISION 



A. Selection 

Candidates for Head Start testers were sought from a variety of 
sources, including colleges and universities in the Washington, D.C., 
Boston, and Philadelphia areas; the Peace Corps Placement Office; the 
United States Employment Service Professional Placement Office; and 
Planning Research Corporation (PRC) personnel. The people who were 
especially helpful in suggesting candidates were Mr. William Campbell 
and Mrs. Gladys George (United States Employment Service), Mrs. 

Leon Sharpe and Mrs. James Russell (Washington Opportunities for 
Women), Mr. Robert Calvert (Peace Corps), Dr. Philip Porter (Har- 
vard University), and Mrs. Louise Crenshaw (Bryn Mawr College). 

Dr. Porter actually recruited and interviewed candidates in the Boston 
area, and Mrs. Crenshaw scheduled interviews at Bryn Mawr College 
so that a PRC representative could personally interview 27 young women 
at the college during the course of one day. Mr. Calvert prepared a 
memorandum and distributed copies to returned Peace Corps volunteers. 
The Head Start staff at PRC prepared a memorandum describing the 

necessary qualifications for testers which was distributed to PRC 
personnel. 

Eventually, 213 candidates were personally interviewed for posi- 
tions as Head Start testers, and 25 were selected to undergo the inten- 
sive tester training programs. Of the 25 who participated in the two 
training programs, 19 actually tested Head Start children; 3 of the 25 
were "selected out" after the training program; and the remaining 3 

were held in reserve as substitute testers, but there was no opportunity 
for them to test. 

The following qualifications were considered to be of prime im- 
portance in the selection of the testers: 
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• A college degree in an area related to education, psychology, 
sociology, or guidance: 

• Experience with disadvantaged children: 

• Language fluency in Spanish as well as English: 

• Flexible summer schedule, with no travel restrictions: 

• Successful completion of the tester training program con- 
ducted by Planning Research Corporation: 

• Favorable recommendation by a college faculty member. 

Exhibit C-1 (page C-7) lists the testers who actually tested and 

presents some descriptive data on each. It should be noted that all 
testers submitted recommendations from faculty members, participated 
in an intensive training program (described below), and were super- 
vised periodically during their testing in the various Head Start Child 
Development Centers. 

B. Training 

Two training programs were conducted, because some of the 
testers who were completing the school year were not available for a 
mid -May program. However, since the testing of children in six cen- 
ters was scheduled to begin on May 16, at least six testers had to be 
available and trained by that date. The first tester training program was 
held May 12-15, 1966, at the Center of Adult Education, University of 
Maryland. Fifteen tester-trainees attended, including two permanent 
PRC personnel who were trained as a reserve or emergency test corps. 
The second training program, also at the Center of Adult Education, was 
held June 8-11, with twelve participants. 

Training was provided not only in the proper administration of the 
various tests (of which the Vineland Social Maturity Scale interview was 
by far the most demanding), but also in sampling procedures, adult- 
adult relationships, test scoring, and general operational procedures to 
be followed. The scope and intensiveness of the training programs may 
be seen in Exhibits C-2 and C-3 (pages C-8 through C-12), which list the 
schedules followed in the two programs. 
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In addition to the formal training sessions, testers were provided 
with a handbook containing deta'^^ed statements of procedures and other 
instructions for mnemonic purposes. The sections on test administra- 
tion and test scoring are reproduced in Exhibit C-4 (pages C-13 through 
C-19) to give the reader a detailed account of PRC's approach to testing 
and to make clear the rules by which the various tests were scored. 

C Supervision 

Each tester was observed at least once during his testing. Testers 
who tected at many centers were, of course, observed more frequently. 
In fact, one or two PRC supervisors were in the field during all weeks 
of testing. In addition, PRC maintained a 24 -hour telephone answering 
service expressly for testers, so that any problem could be handled by 
the PRC staff in Washington at any time. The testers were observed 
as they tested children and interviewed teachers. Usually, the super- 
visor sat behind the child or interviewee to minimize interference with 
the testing. Supervisors took notes during test sessions but, of course, 
never interrupted a session with a comment of any kind. In a private 
conference following the testing and interviewing, the supervisor dis- 
cussed with the tester any areas needing improvement. Each supervisor 
left a memorandum of each conference with the tester, as a reminder. 

Some of the kinds of statements, suggestions, and criticisms 
made by supervisors to testers in the field about their work are pre- 
sented below. 



Behavior Inventory 

1. Examine carefully to see that every line has one check only. 

2. Fill in the current week of operation of the center. 

3. Fill in under "examiner" the name of the person who is filling 
out the BI--not the tester's name. 

4. Put your own tester ID number in upper lefthand corner. 

5. Write out child's name in space provided. 

6. Put total BI score in space marked "school." 

7. There should be 25 positive scores and 25 negative scores. 
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8. Fill in each child's 11 -digit identification number in spaces 
provided. 

9. Use the new cut-out score sheets to score and check. 

Preschool Inventory 

1. Use the PSI manual to ask every question. Remember 
"OVERCONFIDENCE KILLS. " Do n^t deviate in any way 
from the procedures described and explained in the manual. 

2. In the lines provided next to questions 43-47, fill in the 
substance of the child's answer. 

3. Once the child has made one answer to a question, do not 
probe, no matter how absurd the answer may be. Do not 
prompt child to make correct response. 

4. Unless previously specified in PSI manual, do not ask the 
questions more than once; simply say "do as I said" if the 
child seems in doubt as to what he should do. 

5. Don't "rush" a child through the test. Some children just 
need time to think, without probing or pushing. However, 
do not wait too long a time if the child becomes restless; he 
may be stalling or may not wish to answer the question. 

6. Do not repeat what the child said; you will tend to reinforce 
the response. 

7. Be sure to use random and intermittent responses of "good," 
"fine, " etc. 

8 Use phrasing as dictated by PSI manual. 

Peabody Picture Vocabulary Test 

1. Fill out the front side of the PPVT record blank as soon as 
you finish testing the child. A good time to do this is while 
you are waiting for the child to complete the DAP. 

2. Speak clearly and loudly enough so the child has every chance 
to hear the correct word. 

3. Start at about item 15 unless the child seems unusually slow 
on the PSI. Remember that a child gets easily bored if he is 
able to score in the 50's, so give him a chance by starting 



PRC R-886 
C-5 



with item 10-15, rather than 1. You can always go back to 
earlier items, as per the PPVT manual. 

Memorize the instructions from the PPVT manual so you 
can say them in a conversational tone. 

Don't count errors with your pencil. Use your eyes. 

*Score each PPVT question as you ask it; do not ask a group 

of questions and then score, as this induces another level 
of error. 

Vary your phases as the PPVT manual suggests, so that you 
are not always saying "point to. ... " 

Draw-A-Person 

Give the child a pencil with no eraser- -children this age 
love to draw something and then erase it. 

Don't ask, "Is that a person? " or "Would you like to add 
anything else? " 

Do something else (e.g., complete records on the PPVT) 
while the child is drawing. It may m.ake him nervous if you 
seem to be waiting for him to complete his drawing. 

Vineland Social Maturity Scale 

Use phrases that tend to avoid putting words in interviewee's 
mouth: 

a. Tell me how (child) does. . . 

b. Describe how. . . 

c. Does (child) do. , . 

d. Do you think (child) is able to,. . . 

e. Tell me more about how (child) does. . . 

Never ask a direct question using key phrases from the 
Vineland manual. 

Discuss the questions in the order described in the manual, 
beginning with Self Help-General and ending with Socialization. 
Let teacher "talk herself out" if she once gets started, even 
if she deviates from the original question. Her statements 
will provide background information that will give you ample 
opportunity for asking later questions. 
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5. If you have to ask the same teacher about several children, 
vary your questioning techniques and phrases so that she 
is forced to think about each child individually. Teachers 
sometimes say, i.n response to a question, "they all do that." 
Encourage her to discuss children as individuals. 
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EXHIBIT C-1 HEAD START TESTERS 
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Notes: (1) College graduate 
(2) College student 
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EXHIBIT C-2 SCHEDULE FOR FIRST HEAD START TESTER 

TRAINING PROGRAM, MAY 12-15, 1966 



Thursday, May 12 
9:00 p. m. 



Friday, May 13 

8:00-9:00 a. m. 

9:00-9:30 a. m. 

9: 30-10:30 a. m. 



10:30-11:45 a.m. 

11:45-12:45 p.m. 
12:45-1:45 p. m. 



1:45-3:00 p. m. 

3:00-3: 30 p. m. 
3:30-6:30 p. m. 

6:30-7:30 p. m. 
7:30-9:30 p. m. 

Night 



Saturday, May 14 

8:00-8:45 a. m. 

8:45-10:30 a.m. 



Check-in at Center of Adult Education, 
University of Maryland, Adelphi Road 
and University Boulevard, College 
Park, Maryland (phone: 301-779-5100) 

Free time; assembling of test kits 
Breakfast 

Check-in for local trainees 

General orientation and background 
Dr. Allen Ferguson: 

Orientation to PRC 
Dr. H. Russell Cort: 

Orientation to Head Start Project 

Mr. Charles McDaris: 

Project administration 

Lunch 

Dr. R. Ann O'Keefe: 

PPVT - introduction, manual, 
scoring, practice, questions 

Dr. Cort: 

Sampling procedures 

Coffee break 

Mr. Karl Banks: 

Vineland Social Maturity Scale - 
introduction, manual, demonstra- 
tion, scoring, discussion 

Dinner 

Dr. Lois-ellin Datta: 

Testing, with special emphasis 
on tester -child relationships 

Reading assignment: review Vineland 

manual and PSI manual; administer 
and score PPVT, PSI, BI, and Vine- 
land at least once on own 



Breakfast 

More demonstration-discussion of 
tests 
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EXHIBIT C-2 (Continued) 

10:30-11:00 a. m. 

1 1:00-11:45 a. m. 



1 1:45-12:45 p. m, 
12:45-1:00 p. m. 
1:00-3:00 p. m. 

3:00-3:30 p. m. 
3:30-4:00 p. m. 
4:00-6: 15 p. m. 



6: 15 -6; 30 p. m. 
6:30-7:30 p. m. 
7:30 p. m. 



Sunday, May 15 

9: 00-1 0: 00 a. m. 

10:00-11:00 a. m. 

1 1:00 a. m. 



Draw-A-Person - introduction, prac- 
tice, scoring, discussion 

Complete "dry run"; pair testers so 
that one is "child, " the other is tester; 
find place for testing, establish rapport, 
test, score, recheck, etc.; "child" also 
takes role as informant for Vineland 
and BI (this is all simulation, of course); 
prepare testers for arrival of children 

Lunch 

Free time 

Pair child with trainee; administration 
of PSI, PPVT, and DAP 

Debriefing 

Break 

Dr. Charles Dailey: 

A case study emphasizing adult-adult 
relationships and problems. Short 
introductory lecture; discussion of 
case study (read previously) about 
reaction by the "establishment" to an 
outsider; buzz groups; hour discussion 
role playing- -ratings (small groups); 
integration and report- -entire group 

Free time 

Dinner 

More on Vineland, including practice ; 
more on sampling procedures; more 
on special problems; general review; 
field assignments and travel arrange- 
ments 

Breakfast 

Clarification of immediate schedules 
Leave for airports as appropriate 
General dismissal 



o 
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EXHIBIT C-3 SCHEDULE FOR SECOND HEAD START TESTER 

TRAINING PROGRAM, JUNE 8-11, 1966 

; ^ 



Wednesday, June 8 

5:30-6:30 p. m. 



Thursday, June 9 

9:30-10:30 a. m. 



10:30-11:00 a. m. 
11:00-12:00 noon 



12:00-1:00 p. m. 
1:00-1:40 p. m. 

1:40-2:30 p. m. 

2:30-3: 30 p. m. 

3:30-4:00 p. m. 
4:00-6:00 p. m. 

6:00 p. m. 



Mrs. Naomi Henderson: 

Introductory meeting at PRC, 

12th floor conference room; 
hand out test packets and reading 
assignments 



Meeting at PRC, general orientation 

and background: 

Dr. Allen Ferguson 
Dr. H. Russell Cort 

Mr. Charles McDaris: 

Administration of project 

Dr. O'Keefe and Mrs. Henderson: 

Review of nontechnical details 
in PRC's Head Start testers' 
manual 

Lunch 

Dr. O'Keefe: 

Introduction to technical nature 
of work; overview of all tests 

Dr. O'Keefe: 

PPVT - manual, scoring, demon- 
stration practice, questions 

Dr. Cort and Dr. William Commins: 
Sampling procedures 

Break 

Dr. Lois -ellin Datta: 

Testing, with special emphasis 
on tester -child relationships and 
factors 

Adjourn, with study assignment: 

PPVT and manual; PSI and manual; 
Vineland manual- -focus on eight 
categories; Head Start testers' 
manual; administer and score two 
PPVT's; prepare for test on Vineland' 
eight categories 
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EXHIBIT C-3 (Continued) 



Friday, June 10 

9:00-9:30 a. m. 



9:30-11:00 a. m. 



11:30-12:00 noon 



12:00-1:00 p. m. 
1:00-3:30 p. m. 
3:30-3:50 p. m. 
3:50-4:10 p. m. 

4:10-6: 30 p. m. 

6: 30-7: 30 p. m. 
7:30 p. m. 



Saturday, June 11 

8:00-9:00 a. m. 



Check-in at Center of Adult Education, 
University of Maryland, Adelphi Road 
and University Boulevard, College 
Park, Maryland (phone 301-779-5100) 

Dr. O'Keefe: 

PSI - manual, practice, scoring, 
questions 

DAP - administration 
Dr. O'Keefe: 

Complete review of test procedures 
and forms, and preparation for 
children's arrival after lunch 

Lunch 

Test children 
Break 

Dr. O'Keefe: 

Introduction to Vineland Social 
Maturity Scale 

Mr. Karl Banks: 

Demonstration of Vineland and 
scoring; discussion 

Dinner 

Reconvene to receive study assignment: 
read case study; give and score 
Vineland twice; give PSI and PPVT 
once; mark and correct two BI's; 
simulate a DAP; prepare one or 
more complete simulation sets of 
data, organized as per testers' 
manual 

Breakfast 



9: 00-9: 30 a. m. 
9:30-12:00 noon 



12:00-1:00 p. m. 



Submit data prepared Friday night 

Dr. Charles Shaffer: 

A case study emphasizing adult 
adult relationships 

Lunch 
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EXHIBIT C-3 (Continued) 



1:00-3: 30 p. m. 



3:30-4:30 p. m. 
4:30-5:30 p. m. 
5:30-6:30 p. m. 



6:30-7:30 p. m. 
7:30 p. m. 



Test children from a local Head Start 
class: 

Administration of PSI, PPVT, 
DAP, BI (simulate), Vineland 
(simulate), and record keeping 

Score data; break 

Debrief testing experience 

Mrs. Henderson: 

Discuss record keeping 
Dr. 0*Keefe: 

Review of all test data submitted 
D inne r 

Travel assignments and completion 
of all plans 

Adjourn 
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EXHIBIT C-4 EXTRACT FROM PRC'S TESTER'S HANDBOOK 



Adminiitering Testw 

1 . General Procedures 

The Preschool Inventory will help you establish rapport with 
the child and, in any case, must be given before the PPVT. You may 
find it quite natural to start giving the PSI when you first meet the child 
by asking him his name and age. The following suggestions may be 
helpful. 

To establish Rapport With a Child ; 

(1) Say: "We are going to play a game. " 

(2) Give him some of the props (cars, checkers). 

(3) Ask him questions about the props (i.e., get 

him talking). 

(4) Use random, intermittent reinforcement for in- 

correct as well as correct responses. Statements such as "that was a 
good answer, " are good. If he makes no answer even after gentle prod- 
ding, move on to the next one, saying: "Let's try another one." 

b. To Interest a Bored or Distracted Child 

(1) Make him believe he is doing a very important 
thing by helping you. 

(2) Keep your voice interesting; do not talk in a 

monotone. 

(3) Tell him that you will be finished soon. 

(4) If he seems seriously uninterested, perhaps a 
break from testing will be necessary. Employ the principle of playing 
simple games or singing etc., to employ a "change of pace. " Return 
to the testing after 5 minutes. 

c . To Make a Child Stop Crying 

(1) Gain his confidence by offering him sympathy 
and diverting his attention from himself to you or inanimate objects. 
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EXHIBIT C-4 (Continued) 



(2) Return child to class and test him later (if 
possible) if he seems seriously disturbed about the testing situation. 

(3) Reexamine your appraoch with the children if 
many are reacting this way. 

THE TESTS MUST BE GIVEN IN THE FOLLOWING ORDER: 

2. Preschool Inventory (PSD - Used directly with child 

a. At first you will find it necessary to give the test ver- 
batim from the manual. As you become familiar with the acceptable 
answers you may use the test form alone. In any case, be sure to have 
the manual with you. It is important to phrase the questions exactly as 
stated in the manual. Do not reword a question if a child has trouble 
understanding the thought; repeat it once. 

b. The difference between a ”2" and a "1" response 
(items 43-47) depends on whether the child describes a general function 
of this person in society rather than a specific duty or job. 

c. Refrain from accenting the key word in a sentence; 

it offers verbal clues. Practice privately using the actual questions to 
train yourself away from this tendency. 

d. You may notice a tendency to become more lenient 
after testing several children, particularly if you notice that many 
children make the same kind of responses. This is very easy to do-- 
try to avoid this. 

e. Speak slowly and clearly but do not overaccent words. 
Keep your normal speaking voice throughout the testing. Avoid becoming 
"sing-song. " 

3. Peabody Picture Vocabulary Test (PPVT) - Used directly 

with the child 

a. Read the directions on page 7 of the manual to the 



child verbatim. 
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EXHIBIT C-4 (Continued) 



b. Avoid preceding a atimulua word by an article (a, an, 
the) or converting worda to plurala aa thia may provide a cue to the 
child. 

c. Alwaya aecure a reaponae from the child. Do not 
record an anawer of “no reaponae. " You may encourage him to gueaa. 

d. In order to encourage a reaponae, atart the teat with 
"point to, " “what number ia, " or "where ia, " preceding the atimulua 
word. Aa the teat proceeda you may drop the introductory phraae. 

e. After giving teat, check teat book for amudgea, and 
eraae all marka. 

f. Be aure the child' a 11 -digit identification number ia on 
the upper righthand corner of each piece of hia data. 

4. Draw-A-Peraon (DAP) - Uaed directly with child 

a. DAP ia a meaaure of mental maturity which ia baaed 
on the degree of complexity of the child' a pencil drawing of a peraon. 

b. Make child under atand that you want him to draw the 
best possible person. In your instructions simply say: "I want you to 
draw a person. " Then in answer to a question of what kind of person, 
reply: "The best one you know how, " or "Any person you want to draw. " 

c. Do not give clues as to details, such as "What about 
arms and legs? " 

d. Remember to give the child a pencil and not a crayon. 

e. Encourage the child to draw something. 

5. Vineland Social Maturity Scale - Information collected from 

the head teacher (or other staff member who knows the child 

well) via interview 

a. Learn as much as possible about the child before pro- 
ceeding with the test, such as age, schooling, general ability, and home 
environment. This will help you evaluate the information given to you. 

b. Begin questioning well below the anticipated final score. 
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EXHIBIT C-4 (Continueci) 



c. Quiz the teacher in a aympathetic manner; aak doea 
Johnny uaually do so and so rather than can he do so. 

d. After obtaining as much information as possible from 
the teacher, make YOUR OWN scoring judgment. 

e. You should present the various items and groups of 
items by category, as explained in the manual. 

f. FAMILIARIZE YOURSELF WITH THE EIGHT GEN- 
ERAL CATEGORIES AND THEIR ABBREVIATIONS. 

g. The most serious difficulty in giving this test is fail- 
ing to get sufficient detailed information to make a judgment. Do not 
become impressed with random statements such as "Johnny is such a 
sweet child, " "Sally is such a good little girl, " or "Jimmy is a hellion. 
These statements are meaningless. 

6. Behavior Inventory (BI) - Used by the teacher 

The head teacher should fill out the Behavior Inventory for 
each child tested in her class. She should give her general impression 
of the child, as there are no right or wrong ratings. 
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CXMXBIT C~4r (Oontinucd) 



Scoring Teit Formi 

1 . Preichool Inventory (PSD 

a. All items except Nos. 43-47 are scored as either cor- 
rect (1 point) or incorrect (0 points). No distinction is made between a 
wrong answer and no answer. 

b. Questions 43-47 are scored either 2 or 1, depending 
on the level of abstraction as explained in the PSI manual (page 9). 

c. Add the scores for each subtest and the total test and 
write them on the front of the form. 

d. Determine the child's age in months. 

2. Peabody Picture Vocabulary Test (PPVT) 

a. The basal score is determined by working forward on 
the test until the child makes his first error. If he has not made eight 
consecutive correct responses prior to this first error, drop back to 
the starting point and work backward until he has made a total of eight 
consecutive correct responses. 

b. The ceiling is reached when six errors have been made 
in the last eight items presented; count the last item presented as the 
ceiling. 

c. After subtracting the errors from the ceiling, the 
mental age and I. Q. can be determined by consulting the appropriate 
table in the manual. When the I.Q. is below about 55, see page 49 in 
the manual. 

3. Draw-A-Person (DAP) Test 

a. Do not score DAP. 

b. Small children often make the same kind of drawing. 

You will become familiar with some of the similar methods of repre- 
senting body characteristics. For example: small children often repre- 

sent the two arms, but they attach them to the head. If you watch care- 
fully while they are doing this, you will realize that they are arms and 
not ears . 
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EXHIBIT C-4 (Continued) 



BE SURE TO RECORD INFORMATION CONCERNING ADMINISTRATION 
TIMES OF TESTS ON PSI, PPVT, AND VINELAND. 

4. Vineland Social Maturity Scale 

a. One of five ecoree may be given for each item on the 

Vineland Social Maturity Scale: 

(1) (+) for iteme habitually performed (full credit) 

(2) (+NA) for items which the child has had no oppor- 
tunity to perform (full credit if it falls within the range of otherwise con- 
tinuous scores, no credit if it falls within range of otherwise continuous 
minus scores, and half credit if it falls within the intermediate range) 

(3) (i^ for items in the transitional or emergent 

state (half credit) 

(4) (-) for items never or rarely successfully per- 
formed (no credit) 

(5) (-NO) for items which the child has had no oppor- 
tunity to perform and probably would not be able to perform successfully 

b. The highest continuous plus score for all items is con- 
sidered the basal score; the allowance made for lack of opportunity is 
noted above. Plus credit is assumed for all items below the basal . 

c. A completed form should show at least two consecutive 

minus scores in each of the eight categories. 

d. The total score is obtained by adding to the basal score 

the additional credits scattered beyond the basal score. 

e. The total score is then converted to an age equivalent 
(social age, or SA) by checking the table on page 28 in the manual. 

f . Determine the Social Quotient by using this formula: 



Social Quotient (SQ) = 



Social Age (SA) 
Life Age (LA) 



X 100 



Note: Life age means chronological age. 
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exhibit C-4 (Continued) 



g* Finally, complete the face sheet of the record form 
by recording the information obtained (basal score, additional points, 
total score, social age- -which is called »*age equivalent" on the record 
form- -and social quotient). 

5. Behavior Inventory (BI) 

a. Be sure all identification numbers are filled in by the 
teacher for each child's inventory. 

b. If necessary, read instructions on inventory with 
teacher before she completes forms. 

c. Score the BI by using the cardboard scoring key (BI 



Corrector). 
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APPENDIX D 



Thii appendix provides additional detail on tests to supplement 
Information presented in Sections III, IV, and V. Topics covered are: 

• Subjective critiques of the primary test instruments, based 
mainly on testers' comments. 

• Spanish translations of tests. 

• Data on test administration time for PPVT and PSI. 

• Copies (English and Spanish) of the Family Information Form 
(CAP-HS Form 46) and Behavior Inventory. 

A. Discussion of Tests 

There is currently c^'nsiderable interest among educators in de- 
termining and/or developing measures of program effectiveness and 
child development- -especially measures appropriate for use with so- 
called culturally disadvantaged children. This section presents an in- 
formal discussion of tests and measures used in this study, based 
primarily on the testers' experiences and opinions. While no attempt 
was made to elicit systematic observations from testers about the tests, 
most testers did in fact make at least some written comments on spe- 
cific test items or frequently encountered test problems. This section 
is not intended as a criticism of the tests selected for use, their stand- 
ardization procedures, or their overall applicability to young culturally 
deprived children. Its purpose is simply to point out test items or re- 
quirements which frequently posed problems in testing. 

1 . Peabody Picture Vocabulary Test (PPVT) 

The testers did not encounter any significant difficulties in 
administering the PPVT.^ Once the simple pointing response was es- 
tablished, it was relatively easy to proceed through the testing and main- 
tain rapport. 

^The tester shows the child a page with four pictures and says, "Show 
me... (ball, etc,)." 
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There were, however, a few items on the test that elicited un- 
usual answers from the children frequently enough to be noted here. 

Item 15 The test stimulus word is "pulling." The correct 

picture is that of a child pulling a wagon, but children 
often selected the picture of a girl about to knock on 
a door or open it by the knob. 

Item 24 The stimulus word is "baking. " The correct picture 

is that of a girl placing a cake in an oven, but children 
frequently pointed to the picture of a baby eating. 

Item 30 The test word is ".*me" and the correct picture is a 
clock. However, many children spontaneously began 
talking about the picture of two cars crashing. 

Item 38 The test word is "barber, " but many children pointed 
to the picture of a queen on a throne. Some testers 
commented that especially non-white children in 
urban areas have been responding to a hairdresser 
who sets a woman's hair, rather than a man who 
cuts hair. 

Item 44 The stimulus word is "cash, " but many children 

seemed to hear this as "crash" and pointed to a pic- 
ture of a broken dish. 

Some testers commented that children often seemed to enjoy the 
PPVT. Since it was always administered after the PSI, which some 
children foxmd discouraging, in testers' opinions, there seemed to be 
a contrast in ease that appealed to the children. Some testers in fact 
suggested that in the future the PPVT should be given first as an ice- 
breaker. Exactly how the effects of this observation relate to the test- 
anxiety hypothesis discussed in Section V is difficult to say. 



2. The Pre-School Inventory (PSI) 

The Pre-School Inventory, developed by Dr. Bettye Caldwell 
and Mr. Donald Soule of the State University of New York, consists of 
four subtests designed to assess personal-social responsiveness. 
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associative vocabulary, and numerical and sensory concept activation.^ 
In addition to testers’ observations, the item analysis in Exhibit 35, 
Section IV, will serve as a basis for this discussion; Exhibit 35 lists, 
item by item, for both age and program duration, the percentage of 
children who responded correctly. 

Concerning the general procedures for administering the PSI, 
many testers commented that items 1 and 2, which asked the child for 
his first and last name, were troublesome because, to avoid asking the 
question twice, the tester had to be careful not to ask "What is your 
name? " while establishing rapport upon first meeting the child (even 
though it is a natural question with which to begin a conversation). 

Some testers noted that some children seemed baffled by such an ob- 
vious question. Many testers commented that item 3, which asks the 
child's age, would make a good, natural first question to ask. 

In additiOT- testers frequently noted problems stemming from the 
use of various est materials. The brightly colored plastic cars, for 
example, were often a novelty for the children, who seemed to view 
them as attractive toys. The child's main interest in many cases was 
playing, and problems often resulted when testers attempted to position 
the cars for testing purposes. Exhibit 35 shows that PSI items 19 
through 26, which used cars as part of the test procedure, proved 
especially difficult for the children. 

While checkers caused fewer problems, their use as test props 
also presented certain difficulties. Many children appeared to focus 
their attention on the design of an eagle or crown imprinted on the 
checker and could not "imagine" the checker as a train car. As Ex- 
hibit 35 indicates, on items 27 and 28, in which a line of checkers be- 
came a train, very few children of any age responded correctly. 

Although testers had no difficulty in following correct procedures 
for scoring the PSI, their comments on items 43 through 47 ^ suggest 

^Caldwell, Bettye and Donald Soule. The Pre-School Inventory, 1966. 

2 

Items 43 through 47 ask what the following people do: dentist, police- 
man, teacher, father, and mother. 
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that the rating or score assigned to a child's response may be unjustly 
low. The items themselves call for a verbal response; the child re- 
ceives two points if he makes a generalization and one point if he de- 
scribes a specific function (or functions) of the person or concept in 
question. Exhibit 35 shows that only a very few children (usually 
less than 10 percent) scored two points on any item. Testers' reports, 
however, indicated that many children did make a " generali?:ation type" of 
response, but that it deviated greatly from acceptable answers listed in 
the manual. For example, in item 46, which asks what a father does, 
few children answered with the sample responses listed in the manual: 
"Takes care of family- -earns money for family." A large number of 
children, even with a prompting, answered: "He drinks beer; he sleeps; 
he watches TV; he goes out." Such responses suggest that the child 
recognizes the concept in question. 

The test directions used for items 29 through 33^ presented con- 
siderable difficulty. A correct response required both a verbal state- 
ment and a gestural (hand-motion) response. Testers felt that more 
than one example should have been included in the test directions. 

Further, some testers felt that the verb "go," (used in the sense of 
"What is the direction of, , , ? ") may have been misleading to many 
children. For example, a tester noted that a child answering item 32 
("Which way does a phonograph record go? ") responded by humming a 
tune. When asked "How else does it go? " he responded by humming a 
different tune- -a creative response, but an "incorrect" one. 

Certain individual PSI items presented specific difficulties for the 
children, according to testers' notes. Often, a look at Exhibit 35 indi- 
cates a high correlation between individual testers' comments on spe - 
cific items and overall results by children on that item. For example, 
for item 11, the tester points to his or her own knee asking "What's 
this? " Testers commented that often the child thought that a female 
tester was pointing to her hosiery, and would reply "stocking" or "nylon. " 
In the case pf male testers, children often replied "trouser" or "trouser 



^Items 29 through 33 ask which way the following go: saw, elevator, 
ferris wheel, phonograph record, and waterfall. 
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leg." Exhibit 35 shows that item 11 does discriminate among age levels, 
but the overall percentage of correct responses is still very low. 

For item 17, which asks the child to stand up and face the door, 
problems were posed when a school or test room did not have a door. 

For example, in a school in a large urban area, there was a folding 
accordion wall rather than an actual door. In that school, none of the 
15 children tested could respond correctly. Perhaps a more body- 
oriented question such as "Stand up and put your hands on your ears" 
would fulfill the test purpose better. 

Items 19 through 26^ required use of brightly colored plastic 
cars and paper boxes. Items 22 and 25, as seen in Exhibit 35, proved 
especially difficult at all age levels. It seemed at times, according to 
the testers' comments, that the children were "snowed under" by all 
that was desired of them. Further, many testers mentioned that, be- 
cause the tester was required to return all props to their original po- 
sitions following each of those items, the children seemed to have the 
feeling that the testers were "undoing" what the children had done. 

It can be seen in Exhibits 33 and 34 that Subtest 2, which was 
designed to measure associative vocabulary, contains fewer correct 
answers than any other subtest. Items 27 and 28 ("What do we call the 
first car, the one that pulls the train? " and "What do we call the 
last car on the train? ") used checkers to represent a train, and clearly 
presented a problem for children at all age levels. The possible inter- 
ference of the checkers has already been mentioned. A second hypoth- 
esis is that children may have been unable to abstract the concept of 
train when viewing the concrete representation of the checkers. In any 
case, the level of difficulty noted in Exhibits 33 and 34 seems significant 
enough to merit further investigation. 

Cd 

Items 35 through 36 ask questions about the seasons of the year. 
Again, as Exhibit 35 points out, few children at any age level answered 



These items ask the child to place various colored cars on, under, in, 
and behind different sizes and colors of boxes. 

2 

Items 35 through 37 ask what time of year is hottest and coldest, and 
what time of year it is now. See discussion of this subject in Section V. 
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correctly. Teiteri noted that children often gave an aniwer luch ai 
"5 o'clock." Perhapi vocabulary might be atudied to determine whether 
the choice of termi used interfered witiK the children'! compreheniion 
of the test queition. 

Subteat 3, designed to measure numerical concept activiation, dis- 
criminates rather well among the different age levels. However, item 
51, which asks how many toes the child has, was very difficult, even 
for 6-year-olds. Item 56, which asks how many wheels a rowboat has, 
also received very few correct answers. 

For Subtest 4, designed to measure sensory concept activation, 
testers commented that for items 81 and 82 (which ask which of eight 
crayons is the color of the sky and which is the color of the night) many 
children explained, correctly, that the sky is white or gray (clouds, 
smog, etc. ) but these colors were not among the choices available. 

The PSI covers a variety of concepts in a simple and direct manner. 
Like any instrxunent in its early stages of development, it is not without 
problems. The item analysis in Exhibit 35 and the informal comments 
made by testers may offer suggestions for further improvement of the 
measure. 

3, Vineland Social Maturity Scale (VSMS) 

The Vineland Social Maturity Scale (VSMS) is an interview 
schedule which follows a detailed outline designed to evaluate the child's 
maturity in eight areas of development, including social and personal. 

Of all the measures, this was the most difficult to administer be- 
cause the scale is based on the assumption that the respondent will be 
a parent or someone who has actually lived with the child. However, 
testers interviewed teachers or teacher aides, many of whom admitted 
their inability to provide reliable information for certain questions. ^ 

In such cases, testers requested the respondent to make a judgment as to 

The following items frequently presented difficulty for respondents: 

No. 48 (helps at little household tasks) 

No. 64 (bathes self assisted) 

No. 65 (goes to bed unassisted) 

No. 72 (does routine household tasks) 

No. 74 (bathes self unaided) 
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whether or not the child would be "capable" of the behavior in queation. 
One tester commented that too often some responses from some respond- 
ents seemed to be a combination of conjecture, opinion, and fiction'. 
Sometimes it seemed to testers that a teacher tended to describe all 
children in her class in very similar terms. 

Testers often noted that the initial rapport established with the 
teacher or aide greatly influenced the amount of information obtained 
during the VSh4S. Testers reported that some respondents were unco- 

that their unwillingness to spend time for interviews 
definitely limited their responses made on the VSMS. Sometimes, when 
a tester found it necessary to administer several Vinelands to one teacher, 
the respondent developed a tendency either to answer all questions in a 
general manner or to rate different children rather uniformly on all items. 

4. Behavior Inventory (BI l 

Testers' comments revealed the existence of several diffi- 
culties with the Behavior Inventory (BI). Incomplete Behavior Inven- 
tories were often returned to the tester. Teachers repeatedly left items 
1, 27, 41, and 44 unchecked- -a fact which strongly suggests that these 
items were overlooked due to poor format design and uneven layout. 

Since any incomplete BI was useless as a research measuring instrument, 
it was necessary to mail a large number of these forms back to the 
teachers for completion. 

Tester observations frequently quoted teachers and aides as com- 
menting that most of the children were alike. Consequently, it was 
often noted that staff members tended to fill out BI's rather rapidly. 

Thus, despite the fact that testers were careful to give the BI to a staff 
member who knew and worked closely with the child, the accuracy of 
the responses may in some cases be questioned. 

Testers also noted that some teachers, and many of the aides, 
seemed confused by certain vocabulary items. An examination of the 
form reveals a rather sophisticated level of vocabulary, such as "im- 
perturbable" in item 23, "lethargic" in item 46, and "usurps" in item 16. 
Tester data do not attempt to include any record of the respondent's 
understanding of particular items, but in view of the wide range of 
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educational levels reported for staff members who completed the BI, it 
would be interesting to see if the vocabulary or general language style 
could significantly affect responses made. 

Further difficulties arose because of the complexity or ambigxiity of 
test items. In many cases, teachers complained that only c. part of the 
item was applicable to a child. For example, item 49 ("Approaches new 
tasks timidly and without assurance; shrinks from trying new things") 
mentions two characteristics; a child may be very timid, but nevertheless, 
he will still try something new. Item 32 ("Is reluctant to use imagination; 
tends not to enjoy 'make believe games',") presents the same rating prob- 
lem, for while the object is to measure a negative behavioral trait, many 
children who have little interest in make-believe games still possess and 
use their imaginations. Item 7 ("Often keeps aloof from others because 
he is uninterested, suspicious, or bashful") asks for a single rating on 
three separate attributes which are not synonymous. 

In other items, the ambiguity of terms makes it possible for respond- 
ents to interpret key words differently. For example, in item 47 ("Has a 
tendency to discontinue activities aft^r exerting a minimum of effort") it is 
difficult to know by what criteria teachers judged a child to have exerted a 
"minimum of effort." Item 12 ("Is rarely able to influence other children 
by his activities or interests") also has a sense of ambiguity. Here again, 
how does one determine or measure a child's influence on others? 

This absence of clearly defined criteria may also have affected the 
child's score where an item, which is rated as positive on the Behavior 
Inventory, was interpretated by the respondent as a negative characteristic. 
One example is item 8 ("Defends or praises his own efforts”). In some 
cases there is only a fine line of difference between the positive and nega- 
tive characteristic being measured. Compare, for example, items 5 and 19. 

The significance of these complexities and ambiguities has not been 
determined. However, while such factors may be insignificant in terms 
of overall validity, their appearance suggests a need for considerable fur- 
ther refinement of the inventory. 




These items read as follows: "Talks eagerly to adults about his own 
experiences and what he thinks" and "Is excessive in seeking the atten 
tion of adults . " 
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Spanish Teit Translations; PSI, PPVT, and DAP 

Spanish translations of the Pre-School Inventory (PSI), the Peabody 
Picture Vocabulary Test (PPVT), and the Draw-A-Person (DAP) test 
were prepared by two testers who were fluent in the Spanish language. 
Although the translations were primarily based on Spanish spoken in 
Southern Texas, (i.e,, "Mexican Spanish" as opposed to "Castilian 
Spanish") they were found to be fairly consistent with Spanish used in 
other areas such as the West Coast. If, however, a particular word was 
not commonly used in a given locale, the tester substituted a more famil- 
iar Spanish word, and noted the substitution on the child's test record. 

The testers began administering the first test (PSI) in English and trans- 
ferred to the Spanish language as soon as it appeared that the child was 
having difficulty in understanding the tester's English language. 

The general and detailed instructions for administering the PSI, 
PPVX| And DjAbP in Sp&nish follow* 

^ * Q^lLcral Instructions t o Tester s for Administerinjz the PSI. 

PPVT, and DAP in Spanish 

GENERAL NOTE: You should consult bilingual |ieople at 
your test center in order to pick up any regional differences in the 
Spanish, These translations were based on Spanish spoken in South 
Texas, and may vary from that spoken in New York, California, or 
Miami, If you find that a word used on the test is not one used in your 
region, use the latter, and indicate the change on the test, 

PSI - In the brief period before the testing begins, try to discern 
in which language the child is most comfortable. If the children are 
shy, this may be difficult to do. Also, since you will be looked upon as 
a teacher, and since English is usually spoken in the classroom, the 
child may feel he 1^ to speak in English, Begin the test in English. 
and carry the child as far as you can in that language. If he responds 
incorrectly, or appears reluctant or frightened to respond, ask the ques- 
tion a second time in Spanish and mark him according to his second re- 
sponse. After the first direction in Spanish, you should make every 
subsequent direction first in English, followed by the Spanish. Do not 
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pause to allow for a response between the two. Indicate this point by 
placing an S in the margin of the answer sheet. Responses ; If, while 
working exclusively in English, the child responds in Spanish, indicate 
this response by placing an S inside the box on the answer sheet. Once 
you have begun using English/ Spanish, indicate the language in which 
the verbal response was given by placing either an E or an S inside the 
box on the answer sheet. 

PPVT - Give Examples A, B, and C in English . If all three are 
recognized, begin with Item 1 and continue as far as you can in English. 
The first time a child misses a word, repeat the word in Spanish and 
Indicate this by placing an S in the margin. (If the child recognizes it 
in Spanish, give him credit for it. ) Now continue the test giving the 
English word first, followed by the Spanish word. Do not pause to allow 
time for a response between the two words. However, if you notice the 
child moving his hand in response to the English word, indicate this by 
placing an E in the margin. 

DAP • Direction should be: "Haz un retrato de una persona, el 
mejor que puedes." 

2. Specific Instructions 

a. PSI 

1. Cual es tu nombre? 

2. Cual es tu apellido? 

3. Cuantos anos tienes? 

4. CuiTndo es tu cumpleanos? 

5. Muestrame tu ojo. 

6. Muestrame tu* cuello. 

7. Muestrame tu hombro. 

8 Mu/strame tu talon. 

9. -12. (Que^ es esto? - C<^o se llama? - Quemifs?) 

9. Oreja 

10. Dedo 

11. Rodilla 

12. Codo 



o 

ERIC 



PRC R- 386 
D-11 



13. 

14. 

15. 

16. 

17. 

18. 
19. 
19. 

’ 20 . 

21 . 

22 . 

23. 

24. 

25. 

26. 
27. 

27. 

28. 
29. 

29. 

30. 

31. 

32. 

33. 

34. 

35. 

36. 

37. 

38. 

39. 



Muy bien. Qoiiero que hagas unaa cosaa para mi. 

Levanta la mano. 

Muevete. 

voz alta. 
voz auave. 

Ahora, levantate y haga frente a la puerta. 

Ahora, brinque. 

- 26. - Muy bien. Ahora, aientate en la ailla. 

Pon el carro rojo encima de la caja negra. 

Pon el carro azul debajo de la caja verde. 

Pon el carro amarillo encima de la caja chiquita. 

Pon un carro en la caja mediana. 

f l°otr:^Vdo°d. - 

Pon trea carros en la caja grande. 

Pon dos carros detraa de la caja en el medio. 

Dame todo a miu 

- 28. Imagfnate que este es un tren. Sabes lo que es un tren? Tu «a 
tiene muchos carros. uno traTs de otro. asf. ® 

Sabes lo que llamamos al primer carro. el que estira el tren? 

Como se llama el ultimo carro de un tren? 

- 33. - Sabes lo que es un columpio? Tu sabes como se mueve un 

columpio -- arriba y abajo y por el atras y por el frente. 

Dueno. como se mueve un serrucho? 

Como se mueve un ascensor? 

Como se mueve una montana de carnaval? 

Como se mueve un disco? 

Como se mueve una cascada? 

Cuando desayunamos? 

En cual tiempo del ano hace mas calor? 

En cual tiempo del ano hace mas frio? 

En cual tiempo del ano estamos ahora? 

Si quieres encontrar un le on, ddnde lo buscarias? 

Si quieres comprar gasolina, a donde ir{as? 
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40. Si estarias enfermo(a) a quien verias? 

41. Si quieres encontrar barco, donde lo buscarfas? 

42. Si quieres leer algo, que harias? 

43. Que hace un dentista? 

44. Que hace un policia? 

45. Que hace un maestro? 

46. Que hace un pap^? 

47. Que hace una mama? 

4§. - 56. - Cuantos hay? De todo, cu^tos hay? 

48. CuaTntos ojos tienes? 

49. Cua^ntos narizes tienes? 

50. Cuauntos manos tienes? 

51. Cuantos dedos en los pies tienes? 

52. Cuantas ruedas tiene un carro? 

53. Cuantas ruedas tiene una bicicleta? 

54. Cuamtas ruedas tiene un bicicleta chiquita? 

55. Cuantas ruedas tiene una carretilla? 

56. Cuantas ruedas tiene un barco de remo? 

57. Cuenta en voz alta. 

58. Cuantas esquinas tiene este papel? 

59. Cual tiene mas damas? 

60. Cual tiene mas damas? 

61. Cual tiene menos damas? 

62. - 66. Pon estas damas en una linea. 

62. Dame el del medio. 

63. Dame el primero. 

64. Dame el ultimo. 

65. Dame el segundo, 

66. Dame el penultimo. 

67. - 70. - Ahora, quiero que haras unos dibujos. Haz uno asi^. 

71. - 73. - Cual se parece mas un(a)^ 

Cual de los dibujos es eso? 

71. Rueda. 

72. Carpa. 
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73. 


Palo. 




74. 


Cual es mas mayor, ima pelota 


o una bicicleta? 


75. 


Cual es mas mayor, un arbol o 


una flor? 


76. 


/ ^ 

Cual es mas despacio, un carro 


o una bicicleta? 


77. 


Cual pesa mas, un ladrillo o un 


zapato? 


78 . 


Cual pesa mas, una pluma o un 


tenedor? 


79. 


De que color es? 




80. 


De que color es? 




81. 


Cual de estos es el color del cielo? 


82. 


Cual es el color de la noche? 




83. 


Pinta el circiilo amarillo. 




84. 


Pinta el cuadro morado. 




85. 


Pinta el triangulo naranja. 






b. PPVT 




1. 


table - mesa 




2. 


bus - bus 




3. 


horse - caballo 




4. 


dog - perro 




5. 


shoe - zapato 




6. 


finger - dedo 




7. 


boat - barco 




8. 


children - nmos 




9. 


bell - campana 




10. 


turtle - tortuga 




11. 


climbing - subiendo 




12. 


lamp - lampara 




13. 


sitting - sentado 




14. 


jacket - saco 




15. 


pulling - estirando 




16. 


ring - anillo 




17. 


nail - clavo 




18. 


hitting - pegando 




19. 


tire - llanta 
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20, ladder - escalera 

21, snake - vibora 

22, river - rio 

23, ringing - sonando 

24, baking - cocinando 

25, cone - cono 

26, engineer - maquinista 

27, peeking - asomando 

28, kite - giiila 

29, rat - raton 

30, time - hora 

31 , sail - vela 

32, ambulance - ambulancia 

33, trunk - baui 

34, skiing - esquiando 

35, hook - anzuelo 

36, tweezers - pinzas 

37, wasp - avispa 

38, barber - barbero 

39, parachute - paracaidas 

40, saddle - silla dq^aballo 

41, temperature - temperatura 

42, captain - capit^ 

43, whale - ballena 

44, cash - dinero 

45, balancing - balanciando 

46, cobweb - telarana 

47, pledging - jurando 

48, argument - pleito 

49, hydrant - mono de agua 

50, binocular - largavistas 

51, locomotive - locomotora 

52, hive - colmena 
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53. 

54. 



55. 

56. 



57. 

58. 

59. 

60 . 
61. 
62. 

63. 

64. 

65. 

66 . 

67. 

68 . 

69. 

70. 

71. 

72. 

73. 

74. 

75. 

76. 

77. 

78. 

79. 

80. 
81. 
82. 

83. 

84. 

85. 

86 . 



reel - carrete 

insect - insecto 

gnawing - royendo 

weapon - arma 

bannister - baranda 

idol - i"dolo 

globe - globo 

walrus - morsa 

filing - archivando 

shears - tijeras 

horror - horror 

chef - cocinero 

harvesting - cosechando 

construction - construcci^ 

observatory - observatorio 

assistance - asistencia 

erecting - haciendo 

thoroughbred - de sangre pura 

casserole - casuela 

ornament - adorno 

cobbler - zapatero 

autumn - otono 

dissatisfaction - descontento 

scholar - escolar 

oasis - oasis 

soldering - soldando 

astonishment - sorpresa 

tread - 

thatched - techado con paja 
jurisprudence - jurisprudencia 
sapling - renuevo 
arch - arco 
dwelling - residencia 
lubricating - aceitando 
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87. 

88 . 

89. 

90. 

91. 

92. 

93. 

94. 

95. 

96. 

97. 

98. 

99. 
100 . 
101 . 
102 . 

103. 

104. 

105. 

106 . 

107. 

108. 
109. 
no. 
111 . 
112 . 

113. 

114. 

115. 

116. 

117. 

118. 

119 . 

120 . 



pedestrian - peaton 
vale - Canada 
jubilant - jubiloso 
laden - cargado 
pursuit - persiguiendo 
goblet - caliz 
rodent - roedor 
confiding - contando 
reclining - acostado 
frisking - esculcando 
moat - mote 
salutation - salutacion 
barrier - barrera 
foal - potro 

incandescent - incandescente 

cornucopia - cornucopia 

ascending - ascendiendo 

summit - apice 

caster - rodadillo 

lobe - lobulo 

patriarch - patriarca 

sampler - bordado 

ingenious - ingenioso 

repose - reposo 

constrain - detener 

tangent - tangente 

sconce - candelabra de pared 

hoary - canoso 

pendant - pendiente 

prodigy - prodigio 

casement - 

quiescent - tranquilo 

talon - garra 

chevron - galcm 
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c. DAP 

Directions should be: "Haz un retrato de una persona, 
el mejor que puedes. " 

Time Required to Administer PPVT and PSI 

Exhibit D-1 provides information in bar graph form on the mean 
number of minutes spent by testers in administering the PPVT and the 
PSI to 3-, 4-, 5-, and 6-year-old children. The exhibit shows that, 
as might be expected, the older the child, the more time was required 
to administer the PPVT. (The PPVT is open-ended, and older children 
usually cover more test items. ) For the PSI, however, 4- and 6-year- 
olds on the average required about the same amount of time (23.8 and 
23.5 minutes), 5-year-olds required the least amount (22.6 minutes), 
and 3-year-old3 needed the most time (25.1 minutes). (On the PSI, the 
number of items remains constant, and older children tend to go through 
them a bit more quickly than children as young as 3. ) 

The mean administration time for the PPVT is given as 10 to 15 
minutes in the PPVT test manual. ^ Although PRC found that the mean 
times ranged from 8.5 to 11.0 minutes (with standard deviations ranging 
from 3.3 to 4,4 minutes), the PRC study dealt only with relatively young 
children, while the PPVT manual deals with children up to age 18. 

According to the PSI manual, the inventory "takes no more than 
15 minutes to administer to most children."^ This figure differs some- 
what from the required time as reported by PRC testers, since mean 
times (for different age groups) ranged from 22.6 to 25.1 minutes (with 
standard deviations ranging from 6.6 to 8.5 minutes). 

In terms of planning time allowances for future testing of Head 
Start children, the time differences between the various ages from 3- 
to 6-year-olds would probably be insignificant. Realistically, about 
10 minutes should be allowed per child for the PPVT, and about 25 
minutes per child for the PSI. 



Dunn, Lloyd, Expanded Manual, P eabody Picture Vocabulary Test, 
Minneapolis, Minnesota: American Guidance Service, Inc., 1965, p. 5. 

2 

Caldwell, Bettye and Donald Soule. The Pre-School Inventory (Manual), 

1966, p, 3. 
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Note: (1) N’s (sample sizes) for each age differ slightly between PPVT 
and PSI because test administration time was sometimes not 
recorded by testers. 

EXHIBIT D-1 MEAN TIME IN MINUTES REQUIRED TO ADMINISTER 

THE PPVT AND PSI TO THREE-. FOUR-, FIVE-, 

AND SIX-YEAR-OLD CHILDREnCI ) 
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Copiet of Family Information Form and Behavior Inventory 

The Englith and Spanieh vereione of the Family Information Form 
(CAP-HS Form 46) and the Behavior Inventory are presented in Ex- 
hibiti D-2, D-3, and D-4, reipectively. 
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I 

I 



EXHIBIT D-2 ENGLISH VERSION OF FAMILY INFORMATION 

FORM (CAP-HS FORM 46) 



|l 



f 

er|c 






■iiliHHMillrillMH 
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OFFICE OF ECONOMIC 




.IXECUTIVE OFFICE OF THE RESIDENT 
WASNINOTON, OX. 20Sit 



BUD«tT lURKAU NO. 
ANNMOVAL IXNINIt JUNK M, tt«7 



Dokt Parent or Guarw«an: 

Tilt Head Start School in which yonr child ia enrolled ia partially Hnanced hy the Office of 
Economic Opportunity in Waahinaton, D.C. In order to explain the program to the Conareaa and 
the general public, the Office of economic Opportunity needa aome facta about Head Start 
famiTiea. 

Your anawera to the <]ueationa on thia form will be combined with thouaanda of othera from all 
over the United Statea. Thia ahould give ua information about the familiea which are enrolling 
children in the Head Start program. 



If you have trouble anawering any omeationa on the form, pleaae aak for help with it at the Head 
Start Child Develcmment Center. When you finiah, pleaae mail the form ri|^t back in the encloaed 
poatage-paid envelope. 

Sincerely youra, 






Jule M. Suaapian 
A.aaociate Director 
Project Head Start 



If the return envelope ia 
miaaing, mail thia form to: 

Pro{eet Head Start (Summer) 
Census Operotions Office 
1201 East Tenth Street 
Jeffersonville, Indiana 47130 



n. 



CAP’HS FORM 4S (S-20-Sfl) • 
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Pagt 2 



Th* htUwing qvBtlonB or« Bbout o/l thB pBoph wlfh whom fkw child In the Hnod Start tefcool vtvWfy 
Afitwtr tfc«m hy marking an **X" In rarrect kax ar antaring a nmnkar In tha apaca pravldad, 


LIST OF PEOPLE 


NUMBER AND AGES OF PEOPLE 


1. Is thsrs Istlisf* 

•r fsstsr fatlisr lMii| witli mm 
chlUf 


1 □ Yes ^ 

*□ No 


le. Hew eld Is ko? 

1 n s n years 

a □ 21-54 years 4 □ 45 or older 


2. It thsrs • msthtr, tfsp-motiitr, 
•r lotisr in«tk«r living with ihn 

ckiyr 


1 □ Yes ^ 

an No 


2e. How old is sko? 

1 □ Less than 21 years old s □ 55-44 years 
a [ 121 -54 years 4 □ 45 or older 


IF THERE IS NO FATHER OR 
MOTHER AT HOME: 

3. Wko It tk« ckili's gunrdlnn (tk« 
gsrsM ckMIy r*sg«Aslkl« fW 
tk« «kiy t cma)f 


1 □ A brother, uncle, 
grandfather, or 
other mate 
relative— — 0 - 

s O A sister, aunt, 
grandmother, or 
other female 
relative ■..»» 

s □ A man - not 

related o- 

4 □ A woman - not 
related - »- 


3o. How old is the guardian? 

1 n Less then 2t years old 
a □21- 54 years 
!□ 55 - 64 years 
4 □ 65 or older 


4. Ar« tksrs «ny krsfksrs or slstors 
living of komof (Do not count a 
brother or eieter who i* acting 
as the child* M guardian*) 


iQYes ► 


4o. How mony of Hiom oro: 


Number 


an No 


Undor 6 years old? 

6-15 years? 

16-21 years? 

22 years or older? 




5. Aro tkoro any otkor ralotivos (suck 
at grondlparontt, cousins, and 
to on) wno livo in tko tamo houto 
at tko ekild? (Do not count a 
relative who it acting as child** 
guardian.) 


1 □ Yes 

a □ No 


5a. How man/ of them are: 

Undor 6 yoors old? 

6- 15 years? 

16-21 years? 

22-54 years? 

55-64 years? 

65 years or older? 


Number 


6. Aro tkoro any otkor poopio wko 
livo in tko tamo komo but aro 
NOT rolatod to tko ckiid (nor 
acting as tko ckild’t guardian)? 


1 □ Yes 

2QN0 


6a. How many poopio livo in the same house 
and oro NOT related to the child? 

Numbftr 


If the child 1* living with parent*, the following questions i 
If the child Is NOT living with parents, the following quest 

7. Ploato mark an **X'* in tko box wkick shows tko highost 

Father or male guardian 

1 CZIl 1^0 school 5 Q] 9 -to 1 1 years 

2 □ 1 to 3 years 6 □ High school graduate 

3 □ 4 to 6 years 7 □ Some college or 

college graduate 

4 □ 7 to 8 years 


oro about them. 

Ions are about his guardian. 

grade of school the parents or guardian completed. 
Mother or female guardian 

1 □ No school 5 □ 9 to II years 

2 □ I to 3 years 6 □ High school graduate 

3 □ 4 to 6 years . 7 □ Some college or 

, , _ ^ - college graduate 

4 □ 7 to 8 years 



CAP.H» FORM 4« (c>20>6Cl P/*ose Continue on next page 
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t. 



WiMrt U tk« usmI •€ctifcti«ii •! Hm cktN's pcraiit tr fMrdttaiit (For §xompl§, earp§iu§r§, lakororg, oui,) 
Ftthtr or milt guardian Mothar or fdmata guardian 



f. Aro tkoy naw tmpUytdf 
Fathtr or mait guardian 

t □ Now empioyed 
2 Q] Now unempioyed 

TO. How mony montks did tkoy work daring tko post 

Father or maie guardian 

1 □ 12 months 

2 □ 10 ^ J months 

s □ 7 to 9 months 

4 □ 2 to 6 months 
s □ i month or iess 
s □ Did not work 



Mothar or femaia guardian 

1 □ Now tmpioyad 

2 Staying home (keeping house only) 

yoorT (July 1, 2965 to June 30, 1966) 

Mother or female guardian 

1 □ 12 months 

2 □ 10 or i I months 

• D 7 to 9 months 

* CZ] 2 to 6 months 

s □ I month or less 
e □ Did not work 



Tko following gutsf/ons oro ohout fho koine iho child Im flaying In now. 

Tl. Floose mark an “X" in tko box which tolls wkoro tko houso is located. 

1 □ On a farm 

2 □ in the country, but not on a farm 
• IZH • city or town 

4 □ in the suburb or on the outskirts of a city or town 

12. How many rooms in the house or apartment are regularly used for sleeping? 

Number of rooms 

13. Is there running water inside the house? 

Id] Yes 

2 d]No 



The following qufsflons are about fho child who is now In fho Hoad Start school and about any othor chlldron 
you llstod in quostlons 4 or 5 . 

14. Has the child who is now In the Head Start school been in a Head Start, kindergarten, or nursery class before? 

1 d] Yes — In a Head Start class 

2 dH Yes — In kindergarten or nursery class 
sdl No 



15. Have any of the other children been In a Head Start, kindergarten, or nursery class? 

1 d] Tes — in a Head Start class 

2 dH Yes - In kindergarten or nursery class 

3 d] No 



P/eose continue on next page 
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4 

Mi. Wh#n y«ur chiM In tht HnadI Start ichoal wa§ axamlnad by tha daetor «r dontlst was anything found to bo 
th# mottor with him or horf - - 

' Q Yii f Q Not txamlnod 

2 □ No 4 Q Don’t know 

b. If somothlnf was found wrong, did tho child got troatmont lor It? 

1 □ Yet 
^□No 

' s □ Don’t know 
e. Whoro was troatmont givon? 

1 □ Doctor’s or dentist’s office s Q Hospital 

2 □Clinic 4 0 Other rSpeet/y; 

The following quo$tIon» aro about tho family with whom tho child lo living. Thoy do not apply to any poobIo votr 
llotod In quoMtlon 6. rr i , r r r 

17. Does tho family rocolvo any public wolfaro? 

1 □ Yes 

2 □ No 

18. In which of tho following incomo groups would tho child’s family bo? 

(Count income of all family members. Include earningS) welfare, assistance, and all other hinds of income.) 

1 □ Less than $ 1 ,000 a year 6 □ $5,000 to $5,999 a year 

2 □ $ 1 ,000 to $ 1 ,999 a year 7 □ $6,000 to $7,999 a year 

s □ $2,000 to $2,999 a year 8 □ $8,000 to $9,999 a year 

4 I — I $3,000 to $3,999 a year g | | $10,000 or more a year 

5 □ $4,000 to $4,999 a year 

19. Please mark an “X” in the box which shows whether the family has the use of: 

a. A car or truck i □ Yes 2 □ No 

b. A radio i □ Yes 2 I I No 

c. A television set i □ Yes 2 □ No 

d. A telephone 1 □ Yes 2 I I No 

20. Is there anyone in the family who usually gets a newspaper? 

I Yes 2 □ No 

If “Yes” to question 20 — How often? 

1 □ Every day 

2 □ At least once a week 

3 □ Less than once a week 

Than^k you very much for your cooperation. Please use this space If you would like to say something about 
the Head Start school program that may aid the Office of Economic Opportunity in providing better programs 
In the future. For example: the need for transportation, different hours, longer programs, etc. 



CAP’HS FORM 46 (6*20-66) 
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EXHIBIT D-3 SPANISH VERSION OF FAMILY INFORMATION 

FORM (CAP-HS FORM 46a) 
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OFFICE OF ECONOMIC 




iXECUTIVE OFFICE OF THE PRESIDENT 
WASHINGTON, D.C. 20506 

■ UOSKT BURKAU NO. lle.ROSa 
ARRROVAL BXRIRBS juNB » 0 . 19*7 



£stimsdo Padre o encargado: 

ss-.i's: 

otr«*qrv«drf?'d%7od« ^ •"*" comblaadaa coa mile, d. 

acerca de las familias oue eatSn ma^r*r 1 Un^os. Estaa respuestas nos dar^n informaci^n 

ucrua ae las lamiiias que est&n matnculando sus nilTos en el Programa “Head Start.” 

DeHamllo'd” N^o ‘Tead sf« >“ preguntaa, favor de pedir ayuda al Centro de 

sarrolio del INiBo Head Start. Cuando termine de contestar esta forma, favor de enviarla 

mmed.atamente por correo u.ando el aobre que le inclufmoa, el cual no neceaita .ello 

Atentamente, 







Jule M. Sugarman 
Director Asociado 
Project Head Start 



Si no se le incluyd un sobre, 
favor de enviar esta forma 
por correo a: 

ProjBct Head Start (Summer) 
Census Operations Office 
1201 East Tenth Street 
Jeffersonville, Indiana 47130 



CAP-HS FORM 4«a 



Pagina 2 
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**u* *on octree c/t /05 personas qv rtgularnfnf^ vivn con t/ ni»To matriculado tn /o tscut/o 

neod itoft. Confttft tsfos pr9gunta$ morconc/o una "X** tn t/ tneot/Z/oefo corrteto, o onott un numero tn tZ 
tspoc/o provisto. 



LISTA DE PERSONAS 



NUMERO Y EDADES DE PERSONAS 



1 . 1 Hoy un podrt, podroitro o podrt 
dt crionzo vivitndo con tl ninb? 



i[:2sf- 
2 n No 



1o. dQut tdod titnt? 

1 Menor de 21 an os 

2 21-54 anos 



3 55-64 anos 

4 65 aftos 0 mSs 



2. dHoy uno modrt, modrostro o 
modrt dt crionzo vivitndo con 
ti nine? 



1 cz s{. 

2CJ No 



2o. dQut tdod titnt? 

1 □ Menor de 21 anos 

2 ZZ 21-54 afios 



3 ZJ 55-64 aRos 

4 ZZ 65 anos o mis 



SI NO HAY PADRE 0 MADRE EN 
EL HOGAR: 

3. 4Quitn tf tl tncorgodo dtl nino 
(lo ptrsono moyormtntt rtsponsobit 
dti cuidodo dtl niffo)? 



1 ZZ Un hermano, tfo, 

abuelo u otro 
familiar vardn — i 

2 ZZ Una hermana, tfa, 

abuela u otro 
familiar 

mujer 



3 ZZ Un hombre no 

relacionado — 

4 ZZ Una mujer no 

relacionada — 



3o. dQut tdod titnt lo ptrsono tncorgodo? 

1 ZZ Menor de 21 anos 

2 ZZ 21-54 anos 

3 ZZ 55 - 64 aRos 

4 ZZ 65 anos o mis 



4. dTitnt tl nino htrmonos o 
htrmonos vivitndo tn tl hogor? 

(No incluya el hermano o hermana 
que actifa como persona encargada.) 



1 ZZ s'- 

2 ZZ No 



4o. dCuintos hoy? 

Mtnorts dt 6 onos , 
6-15 onos ...... 

16-21 onos 

22 Olios o mis .... 



Nifmero 



5. dTitnt tl nino otros poritntts 
(toits como obutlos, primos, ttc.) 
qut vivtn tn tl hogor con 61? 

(No incluya el pariente que aetda 
como encargado del nino.) 



I sr- 



2CDN0 



5o. dCuintos hoy? 
Mtnorts dt 6 onos 

6-15 onos 

16-21 onos 

22-54 onos 

55 - 64 olios 

65 oRos o m is . . . 



Ndmero 



6< t Exists olguno otro ptrsono que 
vive tn lo mismo coso ptro que no 
ts poriente del nino? (No incluya 
la persona que actua como 
encargada del nino.) 



1 ZZ sr- 



2ZZNo 



6o. tCuintos personos vivtn tn lo mismo coso con 
el nino que no son fomiliores del nir^? 



Numero 



Si e/ ni'nq vive con sus padres, las siguienfes preguntas son ocerco de ellos. 

Si el nifSo no vive con sus padres, las siguienfes preguntas son ocerco de su encargado. 

7. Fovor de morcor con uno **X" en el encosillodo que indico el grodo de escuelo mos alto que los podres o 
encorgodo cursoron. 



Padre o engargado: 

1 ZZ No asistio’ a la 

escuela 

2 ZZ I 3 3 anos 

3 ZZ 6 a j6 anos 

4 ZZ 7 a 8 anos 



5 ZZ 9 a 1 1 aRos 

6 ZZ Graduado de 

Escuela Superior 

7 ZZ Asistii a la universidad 

0 graduado de 
universidad 



Madre o encargada: 

1 ZZ No asistii a la 

escuela 

2 ZZ I 3 3 anos 

3 ZZ 4 a 6 anos 

4 ZZ 7 a 8 afios 



5 ZZ 9 a 1 1 aRos 

6 ZZ Graduado de 

Escuela Superiod 

7 ZZ Asistii a la universidad 

0 graduado de 
universidad 



CAP’HS FORM 4e« (e- 2 i*ee) 



Favor dm continuar en la pagina siguiente 
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Pagiaa 3 



8. ^Cu6l «s la ocupacion acostumbrada d*l padre o encargado del ninb? (Por ejemplo, carpintero, abrero, etc.) 
Padre o encargado Madre o encargada 



9. iEstan empleados actualmente? 

Padre o encargado: 

1 □ Empleado actualmente 

2 □ Desempleado actualmente 

10. iCudntos meses trabajaron durante el ono pasado? 
Padre o encargado: 

1 □ 12 meses 4 Qj 2 0 6 meses 

2 10 0 1 1 meses 5 I mes 0 menos 

3 □ 7 0 9 meses 6 □ No trabajo" 



Madre 0 encargada: 

1 □ Empleada actualmente 

2 I " I En la casa (Cuidano del hogar solamente) 

(Julio 1, 1965 a Junto 30, 1966) 

Madre 0 encargada: 

1 □ 12 meses 4 2 0 6 meses 

2 □ 10 0 1 1 meses 5 □ I mes 0 menos 

3 □ 7 o 9 meses e □ No trabajrf 



Las s/gu/enfes preguntas son acorca del hogar en quo el nino v/ve actualmente. 

11. Favor ds marcar con una "X" *n *1 •ncasillado que IndTca donde esta localizada la casa 

1 □ En una finca, granja 0 rancho s □ En la ciudad 0 pueblo 

2 □ el campo pero no en una finca, granja 0 rancho 4 □ Suburbio, 0 afuera de la ciudad 0 pueblo 

12. dCumtos cuartos en la casa o apartamento se usan regularmente para dormir? 

NiJmero de cuartos 



13. cHay agua corriente en la casa? 

1 Cl] Si 2 n No 



Las siguientes preguntas son acerca del ninb que estd en la escuela "'Head Start" y todos /os otros ninos 
mencionados en las preguntas 4 y 5. 



14. iAsistid el ninb que esta actualmente en la escuela "Head Start" anteriormente a otra close "Head Start," 
a una escuela de nines (Kindergarten), o a una escuela maternal (Nursery)? 



1 □ Sr — A clase de "Head Start" 

2 □ Sf - A escuela de ninos (Kindergarten) 0 escuela maternal (Nursery) 
arn No 



15. 4Ho asistido alg(m otro de los nines en su case a escueio "Head Start", escuela de nines (Kindergarten), 

0 escuela maternal (Nursery)? 

1 Sf - A clase de "Head Start" 

2 Q sf — A escuela de ninos (Kindergarten) 0 escuela maternal (Nursery) 

3 □ No 



Favor do confinuor on la pdgina siguionfo 



Papina 4 
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16a. Cuondo su nine fu* examinodo por el doctor c dentisto en lo Escuelo Heodstort, ise le recomendo o usted que el 
nino recibiero olgun trotom lento? 



1 [-Si 

2 Cj No 



3 [[[] No se examind 

4 □ No se 



b. Si olgun tratomiento fue recomendodo, crecibio el niH^o este trotomlento? 

1 □sr 

2 □ No 

3 [[2 No se 

c. Si recibid el tratomiento, c En donde fue este tratomiento dodo? 



1 [[2 Oficina del Doctor o Dentista 

2 Qj CImica Privada 



3 □ Hospital 

4 □ Otro sitio 



Los sjguientes pregvntas son acerca de la familia con quien al niffo v/ve. No se aplican a ninguna e/e las personas 
inclufdas en la pregunta 6. 

17. cRecibe la ramilio oyudo de Bienestor Publico? 

1 [Z]sr 

2 □ No 

18. iA cual de ios siguientes grupos de entrodo onuol pertenece la familia del nine? (Considere el sueldo de 
todos los miembros de la familia que . iven en el hogar. Incluya dinero recivido en sueldo, ayuda publica, 
asistencia, etc.) 



1 □ Menos de $1,000 al a?(o 

2 [[[] $ 1 ,000 a $ 1 ,999 anual 

3 rn $2,000 a $2,999 anual 

4 □ $3,000 a $3,999 anual 

5 [[[] $4,000 a $4,999 anual 



6 [[[] $5,000 a $5,999 anual 

7 [[[] $6,000 a $7,999 anual 

8 [[[] $8,000 a $9,999 anual 

9 C[] $10,000 0 mas al arfo 



Fovor de morcor con uno 


”X” en el encosillodo 


que indico si 1 


a. Automovil o camion . . 


i[Z]sf 


2 1 1 No 


b. Radio 


i[Z]Sf 


2 1 1 No 


c. Television 


lasf 


2 Q No 


d. Telefono 


1 □ sr 


2 1 1 No 



20. &Hoy olguna persona en lo familia que adquiere el periodico regulormente? 
I Sf 2 □ No 

Si la contestacidn es “Sf” a la pregunta 20 - dCudn o menudo? 

1 [[[] Todos los dfas 

2 [[[] Per lo menos una vez a la semana 

3 □ Menos de una vez a la semana 



Muchas grocios por su cooperocidn. Por favor use este espocio si quiere decir algo ocerco del progromo de 
escuelos **Heod Start” que puedo oyudar a lo Oficina de Oportunidodes Economicos a proveer mejores progromos 
en el future. Por ejempio: lo necesidod de tronsportocidn, horos diferentes, progromos mos iorgos, etc. 



CAP'HS FORM 40a (e*2i>ee) 



ERIC 



mm 
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EXHIBIT D-4 BEHAVIOR INVENTORY (CAP-HS FORM 37) 



o 

ERIC 

hiaifiiifftffffi-aaa 
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CAP-HS Para 37 



OPPICI OP ECONOMIC OPPORTUNITY 
PROJECT HEAD START 

BEHAVIOR INVENTORY 



Child's nams 



School 



Grant No. 



Cantor No. 



Class No. 



Child No. 



Examlnof*s I dantlfl cation 



Data 



I 



1 



I 



Prasant waak of 
cantor's oparation 



INSTRUCTIONS 

Please indicate as accurately as possible how this child behaves by markinx one of the four 
responses to each question. Base /our response to every Item on your personal observation 



1. is usually carefree; rarely becomes frixhtened or apprehensive 


Very 

mu^ 

like 


Some- 

what 

like 


Very 

little 

like 


Not 
at ell 
like 




2 


3 


4 


2. Is sympathetic, considerate, and thouxhtful toward others 










3 . is easily distracted by thinxs xo>nx on around him 










4. Is very suxxostibie; lets other children boss him around 










5 . Talks eaxoriy to adults about his own experiences and what he thinks 










6 . is unduly upset or discouraxed if he makes a mistake or does not perform well . . . 










7. Often keeps aloof from others because he is uninterested, suspicious, or bashful . . 

8 . Defends or praises his own efforts 


















9 . is confident that he can do what is expected of him 










10. is jealous; quick to notice and react nexatively to kindness and attention 

iinAn rhtlcirAn 










11. Is methodical and careful in the tasks that he undertakes 










12. is rarely able to influence other children by his activities or interests 










13. Tries to fixure out thinxs for himself before askinx adults or other children 

for hoio 










14. Greatly prefers the habitual and familiar to the novel and the unfamiliar 










IS AnnA^rc fo trust in his own sbilttios 












16. Has little respect for the rixhts of other children; refuses to wait his turn, 

usurDS toys other children ere oleyini withi etc 










17. Seems disinterested in die x*neral quality of his performance 










18. Responds to frustration or disappointment by becominx exx^’^^^ive or enraxed . . . 

19. Is excessive in seekinx the attention of adults 

90 Stioks witb p ioh until it is finished 




























21. Goes about his activities with a minimum of assistance from others 

22. is constricted, inhibited, or timid; needs to be urxed before enxaxinx in activities 

23. Is even-tempered, imperturbable; is rarely annoyed or cross 


























24. Is reluctant to talk to adults; responds verbally only when urxod . . 

25. Works earnestly at his classwork or play; does not take it lixhtly r ' ' ' 


















26. Is often quarrelsome with classmates for minor reasons 











Plaasa oantiniM an ravarsa sl4a 
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27. Does not need attention or approval from adults to sustain him in his work or play 

28. When faced with a difficult task, he either does not attempt it or fives up 

very quickly 


Very 

much 

like 


Some- 

what 

like 


Very 

little 

like 


Not 
at ail 
like 


Ll 


2 


r 


4 










29. Does not like to be interrupted when enfafed in demandinf activities, 

e.f.. puzzles, paintinf, constructinf things 










30. Welcomes changes and new situations; is venturesome, explores, and 

jenerally enjoys novelty 










31. Calmly settles difficulties that arise without appeal to adults or others 










32. Is reluctant to use imafination; tends not to enjoy * 'make-believe" fames 










33. Likes to talk with or socialize wi^ the teacher 










34. Often will not enfafe in activities unless strorTfly encourafed 

35. is eafer to inform other children of the experiences he has had 




• 














36. Emotional response is customarily very stronf; over-responds to usual 

classroom problems, frustrations, and difficulties 

37. Is uncooperative in froup activities 






• 












38. Is usually polite to adults; says "Please,” "Thank you," etc 










39. Asks many questions for information about thinfs, persons, etc. 

(Emphasis here should be on questions prompted by genuine 

curiosity rather than bids for attention.) 










40. Usually does what adults ask him to do 

41. Requires the company of other children; finds it difficult to work or play by himself 


















42. Responds to frustration or disappointment by becominf sullen, withdrawn, or sulky 

43. Demonstrates imafinativeness and creativity in his use of toys and play materials 

44. Insists on maintaininf his rifhts, e.f.. will not yield his place at paintinf, or 
at the carpentry bench, etc.; insists on fettinf his turn on the slide or in 

froup fames, etc 




























45. 1s wanted as a playmate by other children 

46. Is letharfic or apathetic; has little enerfy or drive 

47. Has a tendency to discontinue activities after exertinf a minimum of effort 


























48. is lenerallv a haooy child 










49. Approaches new tasks timidly and without assurance; shrinks from tryinf new 
thinfs 










50. What he does is often imitated by other children 











DO NOT MARK IN THIS SPACi 



CAP-HS POMM »7 



Pat* 2 



QPO tO««7U 



APPENDIX E 



EXAMINATIONS OF SELECTED CENTERS 



A. Introduction 

Although the major thrust of the Head Start study was not directed 
at comparative evaluation of specific staff, facilities, or center pro- 
grams, an effort has been made to identify, ex post facto, some of the 
most and least effective Child Development Centers. 

What are the elements which might mark a particular center as 
more or less effective? Obviously, there were great variations from 
center to center in terms of staff, physical plant, programing, and chil- 
dren served, not to mention intercenter ''philosophical" differences. 

Staff varied with respect to such factors as race, age, level of 
education, and amount and type of experience with young children. The 
staff of the Child Development Centers, then, could not be labeled as 
homogeneous. 

The centers varied in size, layout of rooms, furnishings, equip- 
ment, lighting, ventilation, outdoor space available, etc. Head Start 
classes were held in facilities ranging from damp, dim church base- 
ments to model air-conditioned classrooms with wall-to-wall carpeting. 

In some centers there was considerable emphasis placed upon 
cognitive stimuation and language development, while in other centers 
staff efforts seemed to be aimed more at meeting certain social and 
emotional needs of the children. While general guidelines for programs 
were provided by Head Start headquarters, daily schedules for individ- 
ual centers varied greatly regarding routines, amount and type of struc- 
tured activity, free play, etc. Where there was more than one class in 
a center, intracenter differences were often marked. Standardization 
of programing was certainly not a characteristic of Head Start classes. 

Finally, the children varied in ethnic and socio-economic back- 
ground, age, achievement, attendance, and other measures. A 3-year- 
old, Spanish- speaking girl of PPVT IQ 70 , from a "broken home" in a 
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Los Angeles slum is indeed different from a 6-year-old, English- 
speaking boy of PPVT IQ 100 from an intact home in a small midwestern 
community. Yet, both could have been Head Starters. 

In summary, one cannot help but be impressed by the sheer di- 
versity of the personnel, plants, programs, and enrollees in the total 
Head Start operation. 

Efforts have been made before to find the factors that distinguish 
"successful" or good CDC's from poor ones. ^ The problem, method- 
ologically as well as conceptually, is complex (see Section V). In this 
appendix, three approaches to the examination of center or direct pro- 
gram variables are presented. The first consists of identifying high 
and low ranking centers on the basis of raw score indices and exam- 
ining associated information. One form of this approach is to use an 
age-corrected index, such as the center average PPVT IQ. A second 
index is to use the center residuals- -that is, the deviations of the cen- 
ter means from their fitted values after regression. Again, the highs 
and lows can be singled out and examined. A third approach, admit- 
tedly weak, given the design of the present study, is to compare classes 
within centers in an attempt to identify outstanding classes. 

The three methodli are presented here more as demonstrations 
than as rigorous and planned primary investigations. Again it should 
be recalled that this study did not utilize pre-testing; it deals with 
comparisons of end test standings, not with gain scores. 

B. Raw Score (RS) and IQ Indices 

In order to obtain at least a crude ranking of individual centers 
as a fiinction of children' s performance, an unweighted average of raw 
score means for each center was obtained by: 

PPVT RS ^ BI RS ^ PSI RS ^ VSMS RS 
center mean center mean center mean center mean 



See, e.g.r Ozer, Mark. The Effects of Neurological and Environ- 
mental Factors on the Language Development of Head Start Children: 
An Evaluation of the Head Start Program. Report of OEO Contract 
5^8, 1965. 
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These sums of raw score means were ranked from highest to 
lowest. The top ten centers (those with the ten highest sums of raw 
score means) were compared with the lowest ten centers. A comparison 
of the former (hereafter referred to as the TT centers) with the latter 
(BT centers), reveals a number of interesting trends (see Exhibit E-1). 

1 . Race 

Judging from the composition of the groups tested, the TT 
centers served a largely white population, while the BT centers served 
a largely non- white population. Although the number of male and female 
children tested at the two sets of centers corresponded rather closely, 
the racial composition was essentially reversed. 

Five of the TT centers had no Negro children tested, and none of 
the remaining five had more than five Negroes tested. Six of the BT 
centers had exclusively Negro enrollees. In the TT centers, 87-1/2 per- 
cent of the children tested were white. In the BT centers, 88 percent of 
the children were non-white. 

2. Age 

The mean age of the 84 males and 52 females tested in the 

4 

TT centers was 71.5 months (or nearly 6 years). In contrast, the mean 
age of the 82 males and 57 females tested in the BT centers was 55.1 
months (or approximately 4-1/2 years). Thus, the tested population of 
the TT centers was, on the average, 16.4 months older than the BT cen- 
ter children tested. 

In eight of the 10 TT centers, the mean age of the children tested 
was greater than 70 months, while in none of the BT centers did the 
mean age reach 70 months. In fact, in all but one BT center it was 
less than 60 months. 

3 . Degree of Urbanization 

Four of the TT centers were classified as rural, two as sub- 
urban, and four as urban. However, all 10 of the BT centers were urban. 
Interestingly, five of these were located in the same city. 



EXHIBIT E-1 SELECTED CHARACTERISTICS OF THOSE CENTERS WITH THE 

TOP TEN AND THE BOTTOM TEN SUMS OF RAW SCORE MEANS 
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4. Number of Classea 

The TT centers tended to have, on the average, a somewhat 
smaller number of classes (2.2) as compared with the BT centers (2.9). 
Three of the TT centers had only one class, while none of the BT centers 
had only one class. 

8. Class Size 

The mean size of individual classes in the TT and BT centers 
was essentially the same--l6.1 and 16.2, respectively. There was a 
considerable range of class sizes, however. In the TT centers, the 
mean class size ranged from 10.7 to 21 children, whereas in the BT 
centers it ranged from 10.7 to 24.3 children. 

6. Size of Centers 

The TT centers averaged fewer children enrolled "in the 
center" (36) as compared with the BT centers (48.2). With respect to 
population, then, the BT centers were 25 percent larger than the TT 
centers. (This trend could be expected, since it has already been noted 
that the BT centers had a larger mean number of classes. ) 

7 . Length of Program 

Among the TT centers, there were four S, six M, and no 
L centers, while for the BT centers the corresponding figures were 
three, four, and three, respectively. 

The mean length of program in weeks for the TT centers was 
approximately 16, while for the BT centers the figure was 19.5. Thus, 
the BT centers spanned, on the average, 3.5 more weeks than the TT 
centers. However, it should be remembered that the designation of a 
center as of the S, M, or L type may be misleading if the total number 
of hours of the program offered at a particular center is not taken into 
account. With the great variation in daily length of program it was 
possible, for example, for children in certain S centers to be exposed 
to more hours of program than children in some M centers or even in 
an L center. For example, children who attended one TT center--an 
S center of 13 weeks' duration- -were offered 325 hours of program. 
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Yet children who attended one of the BT centers --an L center of 29 
weeks' duration- -were offered only 290 hours of program. 

Therefore, due to the great variation in daily length of program, 
the total number of hours of program by center was computed for the 
TT and BT centers. This figure ranged from a low of 100 hours (a 
TT center) to a high of 750 hours (a BT center). 

The mean number of hours of program was 299.5 for the TT cen- 
ters and 434.4 for the BT centers. 

Only one TT center offered more than 500 hours of program, 
compared with five BT centers which did. However, it should be noted 
that at least one of the TT centers gave a relatively large "dose" of 
program (585 hours), while at least two BT centers gave relatively 
small "doses" of program (165 and 195 hours). Stated somewhat dif- 
ferently, the TT centers operated, on the average, for 3l percent less 
time than the BT centers. 

The exhibits found in this section should be read with the under- 
standing that figures dealing with class size and other aspects of pupil 
attendance are often necessarily only approximations. The enroll- 
ment situation in the Head Start Child Development Centers ranged 
from firm to fluid. In some centers, the pupil accounting was concise 
and current, while in other centers attendance records were more 
loosely kept, and day-to-day enrollment figures were open to various 
interpretations . 

To give some idea of the fluidity of the "population picture" in 
a specific center, let us examine excerpts from the notes of a tester 
who tested a center in a large Eastern city. These excerpts have been 
selected from six handwritten pages of one tester' s notes and are 
quoted directly. 

Attendance at this center is very sporadic. Some of the 
children come once a week, some come twice. Many of 
the children who are still enrolled really do not come. 

It should be noted that because of the sporadic attendance 
many of the children really have not been going for 20 
weeks . 
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On the books, 36 children are enrolled; only 3 of the orig- 
inal 35 have dropped out. However, while this tester was 
at the center there were certainly not 36 children^ Out of 
the 25 numbers given me, 8 of them definitely would not 
be tested because the teachers said they really weren't 
coming (and they really weren't). It was rather difficult 
to get even a rough estimate of the number attending; how- 
ever, in class 01 about 13 (of the 20 enrolled) usually 
came. Of course, this isn' t necessarily the same 13 
every day but 13 is about the number attending. 

In class 02 about 10 to 12 children regularly attend. In- 
cidentally, 'regular' here* involves a value judgment. A 
couple times a week is 'regularly, ' mainly because there 
are a lot who don't come at all. 

In summary, when all the Head Start Child Development Centers 
were ranked in order of "sum of raw score means, " the bottom ten 
centers (as compared with the top ten) tended to be more "big city" 
and more Eastern geographically, of longer duration in terms of both 
number of weeks spanned and hours of program, and larger in terms 
of both number of classes and total population enrolled. 

Tested children of the BT centers tended to be non- white and 
younger than those in TT centers. Tested children of both the BT and 
TT centers had a similar mean number of siblings and were enrolled 
in classes of approximately the same mean size. 

With 1.0 representing no attrition, the TT centers (with an attri- 
tion rate of 0.877) displayed a somewhat greater holding power than the 
BT centers (with an attrition rate of 0.778). 

The TT centers tended to be less urbanized and more Western 
and Southwestern geographically. On the average, they lasted fewer 
weeks, offered fewer hours of program, and contained a somewhat 
smaller number of classes and fewer enrollees. Children were more 
often white and older. 

8. Another Ranking 

Exhibits E-2 and E-3 show those centers with the 10 high- 
est and 10 lowest PPVT IQ means, respectively. Data from tester 
reports have been charted for these two groups of centers. 



EXHIBIT E-2 DESCRIPTION OF CENTERS WITH THE TEN HIGHEST PPVT IQ MEANS 



m 



-1 



PRC R-886 
E-8 



if' 


- * 

5 •» f 
^ jr.s c 

5 < 0 






>o 

•0 

'C 




Difficulties 
With Teacher 
Experienced by 
Tester 


NsJ 


X 

X 

X 

X 

X 

X 

X 

X 

X 


O' 




e 

e 

>* 


'^x 


- 






Easy to 
Establish 
Rapport With 
Children 




X 






e 

e 

>* 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 


o 




Heating/ Ventilation 


e 

1 

1 


X 

X 

X 

X 

X 

X 

X 

X 




Fair | 


X 


- 


1 Poor 1 






- 


Lighting 


1 Adequate | 


X 

X 

X 

X 

X 

X 

X 

X 

X 


O' 


u 

e 

hi 






Poor 1 






O 


Quiet, 

Uninte rrupted 
Testing 
Location 


1 No 






e 

e 

>* 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 


o 


O' 


Sampling 
Procedure 
Varied From 
Routine 


1 No 


XXXX XXXX X 


O' 


e 

e 

>* 


X 




m 


No, of 
Children 
in Each 

Class 


15 
9,5 
ll 

14, 15 
20, 15, 15 

16 

15 

16 

16 to 18 
18 




r- 


No. of 
Classes 
per 
Center 


1 

2 

2 

2 

3 

1 

I 

5 

8 

1 


26 


sO 


No. of 
Originally 
Register^ 
Children 
Who 


li 

n 


15 
7 

22 

29 

38 

16 
12 
58 

135 

17 


349 


m 


Total 

Registered 

and 

Attending 
at Time 
of 

Testing 


15 
12 
22 
29 
37 

16 
15 
58 

137 

18 


O' 

ifi 

lO 






Total 

Children 

Registered 

at 

Start 


17 
8 

25 

31 

41 

18 
20 
60 

209 

18 


447 




fn 


s| 


Es 

L 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 






fM 


SI 

j 


h 

IQ 

X 


<OMsOO^rs|0^r^in h- 

o e^ ^ o* ^ e^ O' ^ 




in 

O' 


- 


i| 

“1 


X, ^XsXk o O'X, r- r- o 

mtnrMor-fn^in^ e* 


1 Totals 


a 

i 

s 



i 









PRC R-886 
E-9 




PRC R">886 
£-10 



Only four of the "top ten PPVT IQ mean center! " are alio among 
those ten centers with the top ten sums of raw score means (PPVT plus 
BI plus PSI plus VSMS). Likewise, only four of the "bottom ten ppVT 
IQ mean centers" are among those ten centers with the bottom ten sums 
of raw score means. 

Different criteria for center ranking, then, result in different 
groupings of centers. And there are many possible criteria for ranking 
of centers. PRC has used but two- -sums of raw score means, and 
PPVT IQ means. 

There is a wide range of PPVT IQ center means shown in Exhibits 
E-2 and E-3 (from a low of 61.00 to a high of 102.67). The mean of the 
PPVT IQ means is 68.63 for the bottom ten centers and 95.46 for the 
top ten centers. There is a difference, then, of nearly 27 mean PPVT 
IQ points between the top ten and bottom ten center PPVT IQ means. 

This difference may have important implications for program con- 
tent and teaching methodology. But it also may be yet further evidence 
of the differential test anxiety hypothesis discussed in Section V. In 
seven of the bottom ten PPVT IQ mean centers, the majority of the 
children tested were non-white, and in three of these seven the children 
tested were exclusively non-white. However, in one of the bottom ten 
centers, all of the children tested were white. In nine of the top ten 
PPVT IQ mean centers, the majority of the children tested were 
white, and five of these nine centers had samples which were ex- 
clusively white. However, in one of the top ten centers, all of the 
children tested were non-white. .The mean age in months of the children 
tested in the top ten PPVT IQ mean centers was. 62.86, as compared 
with 61.94 in the bottom ten. 

Five of the bottom ten PPVT IQ mean centers were located in the 
same city. Two of the top ten centers were located in the same city, 
and one of the top ten centers was located in the same city which con- 
tained five of the lowest centers. This was the only top center with a 
totally Negro testing sample. Thirteen of the 14 children tested were 
male. Tester reports did not indicate a particularly outstanding pro- 
gram in evidence at this center. It is possible that these enrollees 
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were jimply lest deprived to begin with. Their mean age of 54.14 
months was the lowest of any of the top ten centers. 

It can be seen that in six of the ten bottom centers, sampling pro- 
cedures varied from the routine (i.e., original sample lists), while in 
only one of the top ten centers did such variation occur. The variation 
involved replacement sampling forced by absences or attendance irreg- 
ularities of the children in original sampling lists. 

Ranking of the centers according to mean PPVT IQ results in a 
considerably different grouping of centers as compared with ranking of 
the centers according to the sum of raw score means on four tests. 
PPVT IQ center means ranged from "retarded'* to "average. " 

In terms of tested children, all but one of the top ten PPVT IQ 
mean centers were predominantly white, while seven of the bottom ten 
PPVT IQ mean centers were non- white. However, again in terms of 
tested childr 'll. one top center was exclusively non-white and one bot- 
tom center was exclusively white. The children in the top ten centers 
were, on the average, approximately 1 month older than bottom center 
children. 

2 

Testers reported necessary variations from the routine in more 
sampling procedures in the bottom ten centers. 

9. Some Detailed Findings 

Exhibit E-4 presents further selected test data of the TT 
and BT centers. Column 2 shows the sums of the raw score means of 
those ten centers which ranked highest on this criterion, and those ten 
centers which ranked lowest. 

In column 4, it can be observed that none of the BT centers had a 
PPVT raw score mean equal to the lowest PPVT raw score mean of the 
TT centers. Stated differently, the lowest PPVT raw score mean of 



^See Exhibits 5 and 6 for further description of untested children. 

^That is, more requirements to use the alternative procedures supplied 
for such purposes. 
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the TT centere wai 42.5, while the highest PPVT raw score mean of 
the BT centers was 36.61. The mean of the PPVT raw score means 
for the TT centers was 17.6 points higher than for the BT centers. It 
can be seen from column 13 that the mean of the PPVT IQ means was 
88.82 for the TT centers and 73.88 for the BT centers, a difference of 
nearly 15 mean IQ points. Four of the TT centers had PPVT IQ means 
above 90, while the highest PPVT IQ mean of any BT center was 83. 

The lowest PPVT IQ mean for any TT center was 77.27, while three of 
the BT centers had PPVT IQ me^ns below 70. Seven of the ten TT PPVT 
IQ means were higher than any of the BT PPVT IQ means. 

On the other hand, it is interesting to note (column 12) that the 
mean of the VSMS Social Quotient means for the BT centers was slightly 
higher than that for the TT centers. The children from the BT centers, 
then, had considerably lower PPVT IQ's, on the average, than the TT 
children, but averaged slightly higher VSMS Social Quotients. 

Column 8 indicates that the highest PSI raw score mean for any 
BT center was 41.54, while the lowest PSI raw score mean for any TT 
center was 52.21. The mean of the PSI raw score means for the TT 
centers exceeded that for the BT centers by approximately 24 points. 

The mean of the BI raw score means for the TT centers was ap- 
proximately 10 points higher than for the BT centers. One BT center 
(68) had a BI raw score mean which was exceeded by only two TT cen- 
ters, and one TT center (41) had a BI raw score mean which was ex- 
ceeded by all but two BT centers; generally, however, TT BI raw score 
means exceeded those of the BT centers. 

It can be seen that the TT center (41) with the low BI raw score 
mean had very high raw score means on the other three tests, while 
the BT center (68) with the high BI raw score mean had low raw score 
means on the other three tests. These findings are difficult to interpret. 

Column 2 of Exhibit E-4 shows the sums of raw score means for 
the top ten and bottom ten centers. It must be stated that nearly 50 
percent of the sums comes from the BI raw score means. 
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10. Summary 

Those centers with the top ten sums of raw score means, 
ai compared with tho.e center* with the bottom ten ium* of raw .core 
mean*, exhibited the following charade rietic*: higher PFVT raw 
*core mean* in all ca*e*; a mean of PPVT raw ecore mean* 17.6 point* 
higher; higher PPVT IQ mean* in *even of 10 ca*e*; a mean of PPVT 
IQ mean* higher by nearly 15 point*; no PPVT IQ center mean below 
77,27; a elightly lower mean of VSMS Social Quotient mean*; a mean of 
PSI raw score mean* approximately 24 point* higher; higher PSl raw 
score center mean* in all cases; and a mean of BI raw score mean* 
approximately 10 points higher. 

C. Tester Informal Reports and Narrative^ 

As part of their activity in the field, Head Start testers wrote in- 
formal reports and narratives concerning each Child Development Cen- 
ter visited. These materials comprise, in effect, brief descriptions of 

the testers' experiences at the site. 

Since the major task of the testers was not a comprehensive eval- 
uation of center faci’^tieE and functionings, they were not specifically 
trained for objective and extensive observation of centers and surround- 
ings. However, perusal of tester informal reports and narratives can 
provide one with the anecdotal impressions of a particular center as 
seen by an "outsider" --a person not directly connected with the center 

operation. 

Material concerning the previously identified "top ten" (XT) cen- 
ters and "bottom ten" (BT) centers, in terms of sum of raw score 
means, has been extracted and organized under several headings (see 
Exhibits E-5 and E-6). 

The testers' narratives did not follow a standard format, and 
there are gaps in the chart. The testers' wording has been shortened 
and. in some cases, paraphrased to fit into limited space. 

1 . Testing Conditions 

It can be noted that testing conditions in four of the TT cen- 
ters were described in positive terms. No negative testing conditions 
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EXHIBIT E-6 ANECDOTAL MATERIAL FROM TESTERS' REPORTS OF THOSE CENTERS WITH 

THE BOTTOM TEN SUMS OF RAW SCORE MEANS 
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were reported at these centers, while in the BT centers, negative 

testing conditions were reported in at least three cases and positive 
conditions in one. 

2. Personnel 

In five of the TT centers, personnel were described spon- 
taneously as either cooperative or "good, " with no negative impres- 
sions of TT center personnel recorded. The picture for the BT centers 
is considerably different. For at least one-half of these, testers had 
negative comments to make regarding the staff, and used such terms as 
"unprofessional, " "very poor, " and "indifferent. " It should be added 
that in three of the BT centers, testers were positively impressed by 
staff cooperativeness. For several of the BT centers, testers made 
mention of noise and confusion. In several of the BT centers, mainte- 
nance of "discipline" seemed to be a major problem. 

Resistance on the part of teachers to testing activity was noted in 
the two BT centers. Apparently this attitude stemmed from testing 
that had been done prior to arrival of the project tester. 

3. Community 

When the tester "community" comments for the TT centers 
are compared with those for the BT centers, it is found the latter cen- 
ters were more often located in areas of a slum, " run-down" nature, 
with greater ethnic mix. 

4. General 

Positive comments regarding the nutritional component of 
the Head Start program were made by testers at three centers (one TT 

and two BT's). There were no negative comments concerning nutrition 
recorded. 



5. Summary 

In summary, testing conditions, personnel, learning envi- 
ronment, and neighborhood were reported as inadequate more often for 
the BT centers than for the TT centers. 
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The global impression of the BT centers derived from the tester 
reports is one oi more noise and distraction and less effective staff- - 
of more pupil behavior not under control. For example, at one BT 
center "Discipline problems with children interfered with the success 
of classroom activities, " according to the tester. At a second BT 
center, "Classes were often mass chaos. " At a third BT center, the 
tester reported "Noise, constant interruptions, and general disorder. " 
No comments of this type were recorded for the TT centers. 

Material from testers' informal reports and narratives concerning 
those centers with the top ten and bottom ten PPVT IQ means has been 
extracted and arranged under four headings in Exhibits E-7 and E-8. 
Blanks indicate that directly applicable material for a particular heading 
was not recorded by the tester, or was included/in Exhibit E-5 or E-6. 

D. Ranking Based on Center Residuals 

The index of ranking in this method is, as described earlier, the 
center residual. Residuals were calculated for each dependent variable. 
Then the highest positive ten and the lowest negative ten centers, based 
on an unweighted composite or sum of residuals for the four tests, were 
located^ and examined with respect to several types of information. 

The results are shown in Exhibit E-9. The centers involved are those 
with a check mark in the second column. They are given arbitrary num- 
bers from 1 to 10 in the first column, and are starred if they were also 
in the top (or bottom) ten, based on the unadjusted composite index. 

A composite index, consisting of the ten highest and lowest of the 
residuals for the PPVT center average raw score, was also used. The 
high and low centers thus identified were those with a check mark in 
the third column. 

The results largely seem to speak for themselves. Among the 
bottom ten centers based on total composite residual value, six also 
had the lowest PPVT residual values. The average center size (with 
3.4 classrooms) was slightly higher than that of the top ten centers. 



^See Exhibit B-3 in Appendix B for a plot of PPVT center residuals 
against fitted values. 



EXHIBIT E-7 ANECDOTAL MATERIAL FROM TESTERS' REPORTS OF THOSE CENTERS 

WITH THE TOP TEN PPVT IQ MEANS 
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The number of all white or non-white centers was the same as 
the top ten, with a higher proportion of non-whites in the mixed centers. 
The testers' comments were favorable about eight of the centers, while 
six of the centers were cited for uncooperative or poor teachers and/or 
poor facilities. 

No single significant factor or condition seems to stand out or to 
be attendant upon a center' s high or low mean score. As has been 
pointed out, the absence of pre-testing has precluded any basis for 
comparison of relative gains or Tosses of the children during the course 
of the program. 

E. Intracenter Class Differences 

A third way of examining centers is to compare performance be- 
tween classes within a center. The overall study was not specifically 
designed for this purpose. Nevertheless, sampling within centers 
tested was based on an approximately equal sample of children drawn 
from each class. In reality, of course, such was not always the exact 
result, owing to absences, etc. 

To examine the effect of different classes within centers, all of 
the centers with two classes and all those with four classes in PRC s 
total sample were listed. Five centers from each group were then 
drawn at random. For each of these sample centers, a single classifi- 
cation analysis of variance was made for each of the four main tests 
(PPVT raw score, PSI total score, VSMS raw score, and BI total 
score). The results are given in Exhibit E-10. The exhibit shows, 
for each sample CDC analyzed, the number of classes, the tests used 
as dependent variables, the urbanization classification of the CDC, the 
type of CDC (S, M, or L), the percentage of non-white children in the 
CDC sample tested, the average age of the sample of children, and 
the total number of children in the sample. It also shows, for each 

dependent variable, the estimate of the treatment variance a. 

A2 

and of the child or error variance a , the ratio of between-mean 

e 

squares to within-mean squares, and the F-ratio with an indication by 
asterisk if the ratio was significant at the 0.05 level. 



EXHIBIT E-10 ANALYSIS OF CLASSROOM EFFECTS WITHIN A SAMPLE OF CENTERS 
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Note; (1) An asterisk indicates that the value is significant at the 0,05 level. 
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It is apparent from Exhibit E-10 that the only significant dif- 
ferences in the two samples of GDC's occurred with the VSlvIS raw 
score as the measure of performance. It is true that the sample sizes 
within CDC's and classes are small indeed. Thus, only relatively 
large differences were likely to have been detected. 

These results suggest that a wide range of factors associated 
with, or assignable to, different classes within CDC's may have no 
appreciable effect on the performance or status of the children as 
measured by these instruments.. Viewed otherwise, the amount of 
treatment variance is generally small relative to the total center var- 
lance ( CT^ / ^ • 

There are a number of considerations to be borne in mind in 
evaluating the results shown in Exhibit E-10. The interpretation of the 
treatment variance, for example, is not unambiguous.. No attempt was 
made here to develop an independent criterion to measure teacher dif- 
ferences or other classroom differences. Also, testers and treatments 
are confounded, since the same tester generally tested each class in 
a given CDC. This was not always the case, but in no case did two 
testers ever test the same children. 

There was one case in which the CDC had divided its classes be- 
tween two buildings. Two testers were assigned to the center. One 
tester tested children from the two classes in one building; the other 
tested samples from the three classes in the other building. An anal- 
ysis of variance of scores for each of the four measures, classified 
by tester, was made. None of the results of the F-tests was signi-^xant 
at the 0.05 level. For the same center, the F- ratios for the PPVT and 
the BI were significant at the 0.05 level when analysis of variance was 
made for classes. The implication is that there was some effect of 
the variables associated with classes; but not with the testers, con- 
tributing to the results for this center. Thus, there is some evidence, 
at least, that tester effects may not have been a serious source of bias 
in the data of this study. This point is discussed and analyzed further 
in Section V. 
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It is possible that the significant F-ratios for the VSMS measure, 
when they occurred, were the result of a tester (interviewer )/teacher 
interaction rather than a teacher/pupil effect. Significant F-ratios for 
other depenjient variables could have resulted from the systematic as- 
signment of children of similar ages to the same class. This apparently 
was not the case in these samples. However, this would be a source of 
confounding that should be considered by anyone attempting a more re- 
fined study along these lines. The factors contributing to the "within" 
variance are many- -age, sex, race, length of time in program, and so 
on--since it was not feasible to make analyses controlling for these and 
other background factors within CDC s. 

It is tempting to conclude that the effect of the individual teacher 
was not very great. However, such a conclusion would have to be based, 
at the very least, on the assumption that the different teachers, even 
within a CDC, were attempting to achieve the same goals, or at least 
the same specifiable and measurable goals. It should also be supported 
by outside or other evidence that there were substantial differences be- 
tween teachers on relevant variables. In PRC s data, the range of 
variation has not been examined in any systematic way. 

We cannot draw a final conclusion about teacher effects in Head 
Start programs from the available data. We can only incline toward 
acceptance of the hypothesis that the immediate effects are not dra- 
matically large with the classes and children studied. ^ 



This conclusion is consistent with findings of Coleman, et al, about the 
relative size of teacher effects in verbal achievement of first graders. 
(See U.S. Office of Education, OE-38001, Equality of E^ducational Op- 
portunity, James S. Coleman, et al, 1966; especially Table 3.25.1, 

frnTTT 
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APPENDIX F 

THE DRAW-A-PERSON TEST RESULTS 



In addition to the four major evaluation measures (PPVT, PSI, 

BI, and VSMS) a Draw-A-Person (DAP) test was administered to each 
child. Although the DAP has been widely used as a measure of intel- 
ligence and has major advantages in that it is relatively simple and in- 
expensive to administer and score, little is known about its validity 
for young disadvantaged children. 

PRC 's sample of Head Start children provided an opportunity to 
estimate the validity of the DAP test for young children from low -income 
families by obtaining DAP test scores and comparing them with PPVT 
and other scores. 

After the completion of the PSI and the PPVT, the tester gave the 
child a blank sheet of paper (8-1/2 by 11 inches) and said, "Draw a per- 
son. Draw the best person you can. " If the child asked whether he 
should draw a man or a woman, the tester replied that the child should 
draw whichever he wished. In the few instances when the child indicated 
that he did not know what a "person" was, the tester explained by point- 
ing out familiar examples of persons or people, such as the teacher, a 
parent, the child himself, etc. Of the 964 tests obtained. 111 were seen 
as drawings of women, in that the figure had long hair, was obviously 
clothed in a dress, or was described by the child himself as a female 
(e.g. , "my mommy, " "my sister," or " a lady"). Two hundred forty- 
seven drawings were unrecognizable (Class A), and seven of these were 
nonresponses in which the child did not put even one mark on the test 
paper. The remaining 606 drawings were considered to be drawings of 
men. 

The scoring procedures and criteria used were those presented 
by Harris (1963). ^ Drawings of women were scored according to the 



^Dale B. Harris, Children's Drawings as Measures of Intellectual 
Maturity. New York: Harcourt, Brace and World, Inc., 1963 



PRC R-886 
F-2 



Draw-A-Woman criteria, and drawings of men were scored according 
to the Draw-A-Man criteria. Blank or unrecognizable drawings were 
marked Class A and received a raw score of 0 or 1, in accordance with 
Harris' criteria. As noted above, only 7 of the 247 Class A. drawings 
were blank. 

Scoring of the drawings was done by a team of four scorers.^ It 
was decided that each of a sample of drawings should be scored inde- 
pendently by each of the four scorers until sufficiently high interscorer 
reliabilities were obtained. Exhibit F -1 indicates the interscorer reli- 
ability coefficients that were obtained on a sample of drawings before 
the scorers began dividing the scoring task so that each drawing would 
be scored only once. Note in Exhibit F-1 that correlations were com- 
puted for Class B drawings only, as well as for Class A and Class B 
combined. Since Class B drawings always had a score of 2 or more 
and Class A drawings always had either a score of 0 or 1, it was felt 
that a correlation between Class B drawings would yield a more rigorous 
check on the scorers' agreement. Exhibit F-1 also includes an inter- 
scorer correlation coefficient which was obtained on a small sample of 
drawings after about half the drawings had been scored. It can be seen 
that all interscorer correlation coefficients were 0.89 or above. 

Exhibits F-2, F-3, and F -4 present the distributions of DAP raw 
scores by children's sex, race, and age, for Short-term, Medium- 
term, and Long-term centers. These exhibits follow the same format 
as Exhibits 19A through 22C in subsection IV.B.l.a. 

Exhibit F-5 presents intertest product-moment correlation coeffi- 
cients among the DAP raw score (RS), DAP IQ, PPVT raw score (RS), 

PPVT IQ, and PSI raw score (RS). Parts A through F of the exhibit present 



Scorers were: Miss Marilyn Danner, a psychology major who had ex- 
tensive experience in scoring drawings for a research project xmder the 
direction of Dr. Dale Harris and Dr. Elizabeth Kirchner at Pennsylvania 
State University; Dr. Lois-ellin Datta, psychologist in the Personality 
Section at National Institute of Mental Health; MU.ss Carol J. Kline, 
doctoral candidate in clinical psychology at the University of Maryland; 
and Dr. Ruth Ann O'Keefe, PRC educational psychologist. 
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EXHIBIT F-1 INTERSCORER RELIABILITY FOR PERSON 

DRAWINGS 



Scorer 


All Drawings » 

in Small Sample' ' 


(2) 

Class B Drawings Only' ' 


B- C 


0.979 


0.963 




(N = 42) 


(N = 27) 


B-D 


0.989 


0.984 




(N = 50) 


(N = 32) 


D- A 


0.989 


0.982 




(N = 42) 


(N = 25) 


1 

Q 


0.976 
(N = 15) 




0 

1 


0.894 
(N = 14) 





Notes: (1) Class A and Class B 

(2) Class B drawings are recognizable drawings which obtain 
a raw score of 2 or more. Class A drawings are blank 
(i.e., non-drawings) or unrecognizable, and can obtain 
raw scores of 0 or 1. For this category, Class A draw- 
ings were eliminated from the computations. 

(3) These correlation coefficients were obtained after 
approximately one-half of the total number of drawings 
had been scored. 
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EXHIBIT F-3 DISTRIBUTION OF DAP RAW SCORES FOR CHILDREN IN MEDIUM- 

TERM CENTERS, BY SEX, RACE, AND AGE 
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Not*: (1) An a*t«ri*k d*not«* an ineligible child, who was in the program for less than 17 weeks. 
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( 1 ) An asterisk dsnotes an iaaligibls child, who was in the program for less than 17 weeks. 
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EXHIBIT F-4 DISTRIBUTION OF DAP RAW SCORES FOR CHILDREN 

IN LONG-TERM CENTERS, BY SEX, RACE, AND AGE 





Mai* 




Whit« 




Age Range 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 


6-5 






01*U) 


• 


01 


01 


00 


04 


M 












01 


01* 


08* 














04 


01 
















04 


01* 
















05* 


03 
















05 


06 
















06 


12 
















07 


14 
















* 07* 


15 
















08 


















11 










Total 


















N 


1 


• 


1 


11 


t 


2 






Moan 


1.00 


• 


1.00 


5.34 


5.19 


4.00 


m 




Modiaa 


1.00 


m 


1.00 


5.00 


3.00 


4.00 


m 


m 


El^ibl* 


0 




1 


t 


7 


1 






M*an 


0.00 


- 


1.00 


5.22 


7.29 


4.00 




m 




0.00 


- 


1.00 


5.00 


4.00 


4.00 


- 


• 



Not*: (1) Am a*t*rl*k d*not»* am ixMligiU* ckil4, who waa in th* program for l*a* titan 25 w**ka. 
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Male 




Non- White 




Age Range 




3.0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6-6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 


6-5 


6-11 




02 


00 


01 


01 


01 


01 


13 






02*(U 


00 


01 


01* 


01 


05* 










00 


01 


01 


01* 


06 










01 


01 


01 


01* 


08 










01 


01 


01 


01 












01 


01 


01 


03 












01* 


01* 


01 


04* 












01* 


03 


01 


05 












01* 


04* 


01 


05 












01 


05 


03* 


05 












01* 


05 


03 


05 












03* 


06 


03 


06 












05 


06 


03 


06 












09 


08* 


03 


06 












09* 


12 


03 


06 












09 




04 


06 
















04 


07 
















04 


07* 
















04 


07* 
















05* 


08 
















05 


08 
















05 


09 
















05 


10 
















05 


11 
















05 


11 
















05 


12 
















05* 


12 
















06 


12 
















06 


13 
















06* 


18 
















06 


21 
















06 


















06 


















06 


















06 


















07* 


















07 


















07 


















07 


















07 


















08* 


















08* 


















08* 


















10* 


















10 


















10 


















10* 


















11 


















12 


















14 


















15 


















16 


















24 










Total 


















N 


2 


16 


15 


53 


31 


4 


1 


- 


Maan 


2.00 


2.69 


3.73 


6.08 


7.35 


5.00 


13. Ou 


• 


Median 


2.00 


l.OO 


3.00 


5.00 


6.00 


5.50 


13.00 


m 


EUgihU 


















N 


1 


10 


12 


42 


26 


3 


1 




Mean 


2.00 


2.70 


3.58 


5.98 


8.00 


5.00 


13.00 




Median 


2.00 


1.00 


2.00 


5.00 


6.50 


6.00 


13.00 


- 




Nol;«: (1) An a«t«ri«k d«not«s an inallgibl* child, who was in th« program for latt than 2$ weak*. 
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EXHIBIT F-4 



(Continued) 





Famala 




Whito 




Ago Ranga 




3-0 


3-6 


4-0 


4-6 


5-0 


5-6 


6-0 


6 — 6 




3-5 


3-11 


4-5 


4-11 


5-5 


5-11 
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6-11 
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Total 


















N 


m 


2 


2 


3 


7 


1 


m 




Moan 


m 


3.50 


5.50 


7.00 


8.29 


12.00 


% 




Moiiaa 


m 


3.50 


5.50 


7.00 


9.00 


12.00 


- 


* 


El^bio 


















m 


2 


2 


3 


5 


1 






Moan 




3.50 


5.50 


7.00 


8.00 


12.00 


«■ 




Modiaiii 


* 


3.50 


5.50 


7.00 


9.00 


12.00 


- 


- 



Not«: (1) An act«rlftk d«not«f an Inaligibla child» who wu in th« program for !••• 25 woaki. 
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F«mal« 






>'3n-Whlta 
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3-0 
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01 
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14 
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02* 
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01 












03 


02 


01 


02 














04 


01* 
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04 


01* 


03 














04 


01* 


04 














05* 


01 


04 
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01 


04 














06 


• 02* 


05 














07* 


03* 


05* 














07* 


03 


05 














07* 


03* 


05 














01 


04 


06* 














13 


05* 


06 














14 


05 


06* 














14* 


06 


07 
















06 


OS 
















06* 


OS* 
















06* 


OS 
















06 


OS 
















07 


09 
















07 


10 
















01* 


10 
















01 


10* 
















01* 


10 
















Ot 


11 
















09 


11 
















09 


11 
















09 


11 
















10* 


12* 
















10 


13 
















10 


13 
















10* 


13* 
















11 


13* 
















11 


13 
















12 


14 
















12 


IS 
















12 


15 
















13* 


17* 
















13* 


20* 
















13 


22 
















13 


24 
















15 


















IS* 


















IS* 










Total 


















N 


3 


6 


19 


4S 


45 


4 


1 




Maan 


2.33 


1.17 


5.51 


7.06 


S.93 


8.75 


15.00 




Madiaa 


2.00 


1.00 


5.00 


7.00 


8.00 


10.00 


15.00 




EUfibla 


















N 


3 


4 


10 


31 


34 


4 


1 




Maan 


2.33 


1.50 


5.60 


6.87 


8.56 


8.75 


15.00 




Madian 


2.00 


1.00 


4.00 


7.00 


8.00 


10.00 


15.00 





Note: (1) Aa astarisk daootea aa iaalifibla child, who was in tha program for lass than 25 waaks. 
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EXHIBIT F-5 



INTERTEST CORRELATIONS FOR TOTAL AND SUB- 
GROUPS: PPVT RAW SCORE, PPVT IQ, DAP RAW 
SCORE, DAP IQ, AND PSI RAW SCORE 



A. 



B. 



C. 



PPVT RS 
PSI RS. 
PPVT IQ 
DAP RS 
DAP IQ 



PPVT RS 
PSI RS 
PPVT IQ 
DAP RS 
DAP IQ 



PPVT RS 
PSI RS 
PPVT IQ 
DAP RS 
DAP IQ 



Note: (1) This includes all children for whom scorable results on all 
five measures (PPVT, PSI. DAP, VSMS, and BI) were 

obtained. 



1 All Children (N 


= 956)^^' 






1 PPVT 


PSI 


PPVT 


DAP 


DAP 1 


1 RS 


RS 


IQ 


RS 


IQ 1 


1.00 


0.73 


0.84 


0.46 


0.31 1 




i;oo 


0.53 


0.56 


0.36 






1.00 


0.25 


0.29 








1.00 


0.85 1 










1.00 


1 Fema 


lie. White (N = 199) 




PPVT 


PSI 


PPVT 


DAP 


DAP \ 


1 RS 


RS 


IQ 


RS 


IQ 1 


1.00 


0.7 8 


0.79 


0.50 


0.40 1 




1.00 


0.59 


0.58 


0.43 






1.00 


0.23 


0.32 1 








1.00 


0.88 1 










1.00 1 


1 Female 


5 . Non- White (N = 263) 




PPVT 


PSI 


PPVT 


DAP 


DAP 1 


RS 


RS 


IQ 


RS 


IQ 1 


1.00 


0.65 


0.83 


0.44 


0.28 1 




1.00 


0.40 


0.54 


0.30 1 






1.00 


0.24 


0,30 I 








1.00 


0,84 1 










1.00 
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EXHIBIT F-5 (Continued) 



/ D. 


Male, White (N 


= 188)*^^ 








PPVT 


PSI 


PPVT 


DAP 


DAP 




RS 


RS 


IQ 


RS 


IQ 


PPVT RS 


1.00 


0.69 


0.88 


0.40 


0.23 


PSI RS 




1.00 


0.07 


0.51 


0.13 


PPVT IQ 






1.00 


0.20 


0.19 


DAP RS 








1.00 


0.89 


DAP IQ 










1.00 




Male, 


Non- White (N = 274) 








PPVT 


PSI 


PPVT 


DAP 


DAP 




RS 


RS 


IQ 


RS 


IQ 


PPVT RS 


1.00 


0.72 


0.87 


0.51 


0.36 


PSI RS 




1.00 


0.55 


0.56 


0.39 


PPVT IQ 






1.00 


0.37 


0.36 


DAP RS 








1.00 


0.88 


DAP IQ 










1.00 


F. 


Children Tested in Spanish (N = 


65) 






PPVT 


PSI 


PPVT 


DAP 


DAP 




RS 


RS 


IQ 


RS 


IQ 


PPVT RS 


1.00 


0.72 


0.86 


0.53 


0.47 


PSI RS 




1.00 


0.52 


0.60 


0.54 


PPVT IQ 






1.00 


0.41 


0.44 


DAP RS 








1.00 


0.96 


DAP IQ 


• 








1.00 



Note: (1) Computations include only white boys tested in English 
^ and omit white boys tested in Spanish. 

I 
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the intertest correlations for (A) all children, and subgroups of (B) 
white females only, (C) non-white females only, (D) English-speaking 
white males only, (E) non-white males only, and (F) children tested in 
Spanish. 

Exhibit F-6 presents, for 3-, 4-, 5-, and 6-year-old children 
separately, intertest correlation coefficients among the DAP and PPVT 
raw and IQ scores and PSI raw scores. 

Exhibits F-5 and F-6 show that the correlations between DAP and 

PPVT raw scores for all subgroups compare favorably with the 0.39 

obtained in a large sample of children in the summer 1965 Head Start 

program.^ The DAP-PPVT IQ correlations, ranging from .19 to .44, 

2 

were generally lower than the raw score correlations. 

Exhibit F-7 presents unadjusted raw score means, IQ's, and 
SQ's derived from scores of the PPVT, DAP, and VSMS, as well as 
standard deviations of all means, for the 956 children who completed 
all five measures (PPVT, PSI, DAP, VSMS, and BI). The scores and 
standard deviations are organized by sex and race of children tested. 

The exhibit shows that all mean IQ estimates (PPVT and DAP) 
for all groups were lower than means of normative samples. (The 
PPVT IQ means ranged from 79.31 to 87.27, and the DAP IQ means 
from 77.22 to 78.71.) The low PPVT scores are consistent with the 
verbal handicap frequently reported for children from low -income fami- 
lies, but the finding that DAP IQ's for all groups were even lower than 
PPVT IQ's, was unexpected. For example, five groups of children 
(sample sizes ranging from 17 to 500) in the summer 1965 Head Start 
project were tested on both the PPVT and the DAP. In all five cases. 



Planning Research Corporation, PRC R-795, Results of the Summer 
1965 Project Head Start, H.R. Cort, Jr., et.al. May 1966, Vol. I, 
p. tv -^8. (Report prepared for the Office of Economic Opportunity 
under Contract OEO-753. ) 

2 

Dr. Lois-ellin Datta was responsible for processing and analysis of 
DAP scores and intertest correlations. She is preparing three separate 
papers based on DAP results from the PRC sample. See footnote 2, 
page F-17. ) • * 
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EXHIBIT F-6 ^ INTERTEST CORRELATIONS FOR THREE-, FOUR-, 

FIVE-, AND SIX- YEAR-OLD PPVT RAW SCORE, 
PPVT IQ, DAP RAW SCORE, DAP IQ, AND PSI 
RAW SCORE 



Tests 


Age 


3 


4 


5 


6 


Total 


N 


72 


407 


346 


131 


956 


PPVT RS/PSI 


0.48 


0.70 


0.63 


0.71 


0.73 


PPVT RS/PPV 


T 












IQ 


0.89 


0,95 


0.91 


0.98 


0.84 


PPVT RS/DAP 












RS 


0.38 


0.35 


0.31 


0.38 


0.46 


PPVT RS/DAP 












IQ 


0.37 


0.35 


0.34 


0.42 


0.31 


PSi/t^pvt IQ 


0.34 


0.64 


0.60 


0.68 


0.53 


PSI/ DAP RS 


0.39 


0.41 


0.45 


0.44 


0.56 


PSI/ DAP IQ 


0.38 


0.40 


0.45 


0.46 


0.36 


PPVT IQ/ 












DAP RS 


0.24 


0.29 


0.30 


0.34 


0.25 


PPVT IQ/ 












DAP IQ 


0.22 


0.28 


0.33 


0.39 


0.29 


dap/rs 












DAP IQ 


0.99 


0.98 


0.98 


0.98 


0.85 



EXHIBIT F-7 RAW SCORE IQ AND SQ MEANS AND STANDARD DEVIATIONS 
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Note; (1) This figure is less than the actual total of 964, due to blanks on the DAP tests. 
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the DAP IQ estimate was higher than the PPVT IQ estimate (mean DAP 
IQ range was 76 to 111, and mean PPVT IQ range was 68 to 91). ^ 

Dr. Lois-ellin Datta has undertaken three separate studies in- 
volving the DAP test results obtained from the PRC samples; one of 
these examines in some detail the findings shown in Exhibits F-5, F-6, 
and F-7. The other two studies, co-authored with Ann Drake of the 
National Institutes of Health, concern "Objective Sexual Differentiation 
in the Drawings of Preschool Children" and "The Smiling Faces: A 
Comparison of the Effect of Drawings of Children from Middle Class 
and Very Low Income Families. " The former has been submitted to 

o £ Consu lting Psychology, and is available in mimeographed 
form from Dr. Datta. The latter, which will incorporate a study of 

motivational factors relating to performance on the DAP, is currently 
in preparation. • 
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Datta, Lois-ellin, "The Draw-A-Person Test as a Measure of Intel- 
ligcxice in Preschool Children from Very Low Income Families, " 
j£-^nal of Consulting P sychology, to be published in April or June I 968 . 
(Mimeographed copies are currently available from: Dr. Lois-ellin 
Datta, 2N256, Bldg. 10, National Institutes of Health, Bethesda. 
Maryland 20014, ) 



